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VERTICAL AND HORIZONTAL PROCESSES IN 
PROBLEM SOLVING * 


HOWARD H. KENDLER 
New York University 


The present paper is concerned with 
an approach—and not the approach— 
to the universally appealing but never- 
theless unpopular research area of 
problem solving. Problems of problem 
solving have proved to be particularly 
refractory to psychologists. More 
often than not the uncommon re- 
searcher with the temerity to attack 
some aspect of reasoning retreats to 
more secure and conventional problems 
when he discovers that his sorties fail 
to achieve any impressive victory. As 
a result the literature of problem solv- 
ing is almost chaotic because it is so 
heavily sprinkled with isolated bits of 
information (Duncan, 1959). 

Perhaps the present stage of de- 
velopment of psychology does not 
justify the strategy of investigating 
such a complex phenomenon. Fortu- 
nately, or not, science has no built-in 
traffic lights to inform investigators 
when to proceed. It may be a risky 
and potentially unfruitful gambit to 
investigate problem solving but then 


1An earlier version of this paper was 
delivered by Howard H. Kendler as an in- 
vited address to the 1960 meeting of the 
Eastern Psychological Association, which 
was held in New York City. The authors 
are indebted to the Office of Naval Research 
and the National Science Foundation for 
their support of the research reported in 
this paper. 


TRACY S. KENDLER 
Barnard College 


again it may not be. In addition to 
the intrinsic interest of the area it does 
offer a challenge to those psychologists 
who are interested in testing the gen- 
erality of any set of theoretical prin- 
ciples stemming from other areas of 
behavior (e.g., learning, perception). 

This paper initially will make fleet- 
ing references to some methodological 
problems with which a researcher in 
the field of reasoning must contend. 
Then a simple pretheoretical model of 
problem solving will be described, fol- 
lowed by a report of research which 
the model generated, and which in turn 
is shaping the model itself. 


METHODOLOGICAL PROBLEMS IN PROB- 
LEM SOLVING RESEARCH 


Anybody who does research is—or 
should be—aware that every decision 
he makes cannot be justified by facts 
or logic. Some decisions must be 
made on the basis of personal intuition. 
This is particularly true for the re- 
searcher in problem solving who must 
make three strategic decisions which 
cannot help but have profound influ- 
ences on his research and the ideas 
they generate (Kendler, 1961). These 
decisions, which are not completely in- 
dependent, are related to the place of 
problem solving in psychology, the use 
of complex or simple experimental 
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tasks, and the selection of a pretheo- 
retical model to guide research. Con- 
sidering the volitional nature of these 
problems, as well as the current status 
of psychological knowledge, it would 
be both inappropriate and erroneous to 
consider these methodological problems 
as offering only one sensible alterna- 
tive. Adopting this point of view 
would do much to minimize the need- 
less disputation that seems to peren- 
nially surround matters of research 
strategy. 

Accepting the principle that a basic 
research strategy is not simply an out- 
growth of logical and factual consid- 
erations does not reduce one to making 
decisions in either a haphazard or ran- 
dom manner. A given strategy can be 
adopted on the basis of rational con- 
siderations as long as it is realized that 
other reasonable attitudes might lead 
to the adoption of different decisions. 

The history of problem solving in 
particular and psychology in general 
suggests that problem solving can best 
be conceptualized not as a basic psycho- 
logical process, but instead as one that 
reflects the interaction of more funda- 
mental processes (e.g., learning, per- 
ception, and motivation). 

If problem solving is not viewed as 
a unitary process, how is an appro- 
priate experimental situation selected 
to investigate it? One possibility is 
that a problem can be selected from a 
“true life” situation such as trouble- 
shooting electronic equipment. Or 
problems can be invented (Duncker, 
1945; Maier, 1930) that capture the 
flavor, if only partially, of problems 
we meet in everyday life. 

A more analytical approach can be 
taken to the selection of an experi- 
mental situation to investigate problem 
solving. If problem solving is com- 
pounded of elementary behavioral proc- 
esses, then it may be more strategic to 
devise some simple problems in which 


the relationships of fundamental psy- 
chological mechanisms to problem solv- 
ing are. highlighted. That is, tasks 
should be devised not to duplicate or 
imitate everyday problems, but instead 
to isolate and magnify the basic mech- 
anisms that operate in such complex 
tasks. 

This analytical approach which is 
favored by the authors suffers from one 
major drawback. How is it possible 
to know the basic mechanisms of prob- 
lem solving prior to their discovery? 
Obviously, excepting divination, there 
is no method. But this does not pre- 
vent the analytical approach from op- 
erating. The researcher can prejudge 
theoretical issues by formulating a 
model of what he guesses problem 
solving to be like. The model can 
guide the investigator in selecting the 
hypotheses to test, as well as the expe- 
rimental situations in which to test 
them. 

This brings us to the third and most 
important decision a problem solving 
researcher has to make: his choice of 
a pretheoretical model (Koch, 1959). 
A pretheoretical model is not equiva- 
lent to a theory. The criterion of 
validity cannot properly be applied to 
it because essentially a pretheoretical 
model is an informal conception that 
operates as an analogy (Lachman, 
1960). It is conceivable that different 
models (e.g., learning, perception, in- 
formation theory) can all lead to fruit- 
ful and valid theories of problem solv- 
ing. 

Psychologists have many possibilities 
from which to choose their model. 
These models can be conveniently di- 
vided into two main categories: the 
empirical model that springs primarily 
from experimental data, and the for- 
mal model .that is usually generated 
by mathematical or logical systems. 
Among the empirical models that have 
achieved some acceptance are those 
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that are based on introspective findings 
(e.g., the four successive stage model 
of “preparation,” “incubation,” “inspi- 
ration,” and finally “verification’”), 
the facts of perception, and those of 
learning. Some formal models used 
are those dependent upon stochastic 
models, game theory, and the operation 
of computers. 

The present authors adopted an S-R 
learning pretheoretical model. The 
decision no doubt was influenced by 
professional training and past research 
efforts. But other considerations en- 
tered. For the past 4 decades S-R 
learning psychologists have probably 
been the most active experimental and 
theoretical group in psychology. To 
some, if not a large, extent this can be 
attributed to the fruitful and cleansing 
effect S-R language has upon design- 
ing, reporting, and interpreting re- 
search. S-R language forces the psy- 
chologist to focus his attention on ob- 


jectively defined environmental and 
behavior variables and thus encourages 
the collection of data and the testing of 


ideas. The efforts of S-R learning 
psychologists have supplied a host of 
facts, concepts, and hypotheses that 
can be exploited in an exploratory ex- 
cursion into the realm of problem 
solving. 

The facts and theories of learning, 
however, do not spontaneously coalesce 
to form a model that can guide re- 
search in problem solving. Some se- 
lection must be made. S-R learning 
theory does not represent a single or- 
ganized formulation. Anyone who is 
familiar with the systematic orienta- 
tions of Hull (1952), Guthrie (1952), 
Spence (1956), and Skinner (1953) is 
aware of this. Many of these syste- 
matic differences, however, become at- 
tenuated and some even disappear 
when viewed from the distance of prob- 
lem solving behavior. It is possible 
and perhaps even profitable to de- 


velop a learning model for problem 
solving that ignores many of the points 
of disagreement among S-R theories. 

Much of the objection to S-R lan- 
guage stems from the apparent dis- 
crepancy between active, flowing be- 
havior and the inert, static, single S-R 
association. Using S-R language does 
not mean that complex behavior ac- 
tually consists of S-R_ connections. 
After analyzing the concept of light, 
Toulmin (1953), concludes: “We do 
not find light atomized into individual 
rays: we represent it as consisting of 
such rays” (p. 29). Applying the 
same idea to the concept of the S-R 
association : “We do not find behavior 
atomized into individual S-R associa- 
tions: we represent it as consisting of 
such S-R associations.” The concept 
of the S-R association, therefore, must 
be judged not in terms of its ability to 
provide a clear image of behavior, but 
rather in its capacity to represent the 
facts of behavior. 


PRETHEORETICAL MopEL OF PROBLEM 
SOLVING 


An S-R model needs to represent 
two important characteristics of prob- 
lem solving behavior. These charac- 
teristics are behavior is continuous, and 
at any one time behavior consists of 
several habits. The terms “horizontal” 
and “vertical” are used to refer to these 
processes ; horizontal to the continuity 
of behavior against the dimension of 
time, and vertical to the assumption 
that independent levels of behavior 
(i.e., S-R units) occur simultaneously. 

The assumption that S-R associa- 
tions do not occur in isolation, but 
instead are linked together to form 
integrated, continuous behavior goes 
back many years (e.g., Watson, 1913). 
Today the process is most commonly 
referred to as chaining. Skinner 
(1953) and his associates have de- 
veloped powerful techniques that shape 
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salivation 


Fic. 1. An S-R representation of 


classical conditioning. 


behavior into long, complicated chains. 
The mass of data they have collected 
suggests important principles govern- 
ing habit chaining. There is litle doubt 
that when their quasitheoretical system 
is exploited fully with autoinstructional 
devices that important insights into 
problem solving behavior will emerge, 
particularly in relation to how an added 
bit of knowledge can trigger problem 
solution. The kind of chaining with 
which the Skinnerians have dealt (i.e., 
adding new S-R units to an already 
functioning chain) does not exhaust 
all the problems associated with the 
horizontal processes of problem solv- 
ing. Of particular importance to prob- 
lem solving is the spontaneous integra- 
tion of separate habits which occurs 
when an organism infers the conse- 
quences of combining previously inde- 
pendent S-R units. This kind of 
chaining was investigated in Kohler’s 
(1925) classical studies of insight and 
in the more controlled reasoning ex- 
periments of Maier (1930). More re- 
cently the authors (Kendler & Kendler, 
1956, 1961; Kendler, Kendler, Plis- 
koff, & D’Amato, 1958) have tried to 
identify some of the important vari- 
ables that enable children to combine 
separate experiences in order to solve 
an inference-type problem. Much of 
the research reported in this paper will 
be concerned with how mediated stimu- 
lus and response events aid in the for- 
mation of problem solving chains. 
The assumption of vertical processes, 
ie., the organism responds several dif- 


ferent ways at any one time, is also 
not a novel one. Every psychologist 
is aware that organisms make several 
different responses simultaneously, al- 
though typically only one is attended 
to. Sometimes the different responses 
are interrelated, as is the case between 
the heart and respiration rates of a 
fearful organism. In other cases the 
different responses are independent, 
e.g., a person’s conversation is unin- 
fluenced by his tugging at his ear lobe. 
The best laboratory example of vertical 
processes, and one that has much rele- 
vance to problem solving, is shown in 
Figure 1. Those familiar with intro- 
ductory psychology textbooks will 
recognize this diagram as representing 
classical conditioning. Notice that the 
two solid lines indicate independent 
S-R units which are operating simul- 
taneously. One is the tone that ini- 
tiates the “investigatory” response, and 
the other is the food which elicits sali- 
vation. Initially these two associations 
operate in a parallel fashion, but as a 
result of their simultaneous occurrence 
an interaction takes place which is ex- 
pressed by the broken line representing 
the acquired conditioned response. 

Obviously the brief reference to hori- 
zontal and vertical processes in which 
it is assumed fundamental S-R prin- 
ciples operate (e.g., discrimination, ' 
generalization, etc.) presents at best 
the barest skeleton of a model of prob- 
lem solving. It needs the flesh and 
skin of experimental facts to give it 
solidity and theoretical principles to 
clothe it in scientific respectability. 
Let us now review some of the prog- 
ress that has been made in this direc- 
tion. 


Concept LEARNING AND UTILIZATION 


Although the primitive model just 
described fails to generate any research 
by itself, it does suggest that individual 
experiments cannot be directed at 





VERTICAL AND Hor1izONTAL PROcESSES IN PROBLEM SOLVING 5 


problem solving in its entirety. There 
are too many aspects to this phenom- 
enon. The researcher, in designing an 
experiment, must scan the entire prob- 
lem solving process and then focus 
upon that segment that promises to 
yield fruitful results and is also amen- 
able to investigation. 

For reasons that will become evident, 
it was decided to compare reversal and 
nonreversal shifts in a simple concept 
learning task. Figure 2 characterizes 
each kind of shift by showing a simpli- 
fied version of an experimental situa- 
tion used with children. The stimuli 
(cups) for their first discrimination 
differed simultaneously on two dimen- 
sions (size and brightness). The sub- 
ject is rewarded for responses to one 
dimension (e.g., large cup is positive, 
small cup is negative). The other di- 
mension is irrelevant. After learning 
the first discrimination, the subject is 
forced to shift to another response. In 


a reversal shift the subject is required 
to respond to the same dimension on 
which he was originally trained, but his 
overt choice has to be reversed, e.g., 
he has to shift from a large cup to a 


small one. For a nonreversal shift the 
previously irrelevant dimension be- 
comes relevant, e.g., black becomes 
positive after large had been positive. 

Buss (1953) reported that college 
students executed a reversal shift more 
rapidly than a nonreversal shift. He 
attributed this superiority to the inter- 
mittent reinforcements that retard the 
progress of a nonreversal shift. For 
example, in Figure 2,? when a subject 


2The purpose of Figure 2 is to clarify 
the meaning of both a reversal’ and non- 
reversal shift. It would be misleading to 
believe that it represents exactly the meth- 
odology of “reversal-nonreversal” studies 
reported in this paper. For all experiments 
reported, except that of Buss (1953), de- 
signs were used that controlled for fortuitous 
intermittent reinforcements effects in a non- 
reversal shift. 


is making a nonreversal shift from 
large positive to black positive, he is 
reinforced when choosing the large 
black cup in preference to the small 
white cup. This fortuitous reinforce- 
ment of the choice of the large cup 
helps maintain the size discrimination 
and hence retards the learning of the 
brightness discrimination. The rever- 
sal shift group, on the other hand, re- 
ceives no reinforcement of the previ- 
ously correct responses, since they are 
100% nonreinforced. 

This analysis is at best incomplete. 
The work of Kendler and Vineberg 
(1954) suggested that adult human 
concept learning cannot be represented 
adequately by a single-unit S-R theory 
in which the external stimulus is “di- 
rectly connected to the overt response. 
Instead, a mediational mechanism (see 
Figure 3) is required which assumes 
that the external stimulus evokes an 
implicit response which produces an 
implicit cue that is connected to the 
overt response. 

It would be useful to digress for a 


First 
Discrimination 


Second 
Discrimination 


Small Positive 


Fic. 2. Examples of a reversal and a 
nonreversal shift. 
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moment to comment about the epis- 
temological status of these inferred 
stimulus and response events which are 
enclosed in the rectangle to emphasize 
their hypothetical character. Although 
not directly observable, they are “tied 
to” environmental and __ behavioral 
events. The basic assumption of the 
mediational hypothesis, at least for the 
time being, is that the implicit stimulus 
and response events obey the same 
principles that operate in observable 
S-R_ relationships. 

The mediational hypothesis has gen- 
erated confusion. Perhaps the follow- 
ing brief statements will clarify some 
possible areas of misunderstanding. 

1. The mediational hypothesis is 
neither new nor revoluntionary. Meyer 
(1911) and Watson (1913) referred 


to it, and Hull (1930) gave it a more 
formal status by coining the concept 
of the “pure stimulus act.” Guthrie 
(1952) has always laid heavy stress 
on a mediational-type hypothesis when 
emphasizing the importance of proprio- 
ceptive stimulation in learning. 

2. The implicit stimulus and_re- 
sponse events need not be conceived 
as having an existence independent of 
their relation to independent and 
dependent variables. These implicit 
events are theoretical constructs. Their 
epistemological status is closer to such 
concepts as drive and habit than to 
directly observable stimulus and_ re- 
sponse events, 

Some mediating events can conceiv- 
ably and probably will be coordinated 
to introspective reports, language be- 
havior, muscular movements, and other 
observable events. Coordinations of 
this sort can be useful in developing 
mediational theory. But such coordi- 
nations are not essential to mediational 
theory. The fact that genes are not 


SINGLE UNIT THEORY 
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Fic. 4. A single unit and mediational S-R analysis of a reversal and nonreversal shift. 
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directly observable (at least according 
to the geneticists consulted) does not 
interfere with their theoretical and 
practical usefulness. Even if it were 
possible to observe a gene directly, it 
would be necessary to distinguish be- 
tween it as an observable entity and 
as a concept within a nomological net- 
work. It would be unwise, and strate- 
gically shortsighted, to identify media- 
tional events with introspective reports 
or language behavior, or other observ- 
able events. The “validity” of the 
mediational mechanism does not de- 
pend on being coordinated with ob- 
servable events, but depends instead on 
being utilized in a successful explana- 
tory system. 

Figure 4 characterizes reversal and 
nonreversal shifts in terms of both a 
single unit S-R analysis and a media- 
tional one.’ It would be predicted, 
according to a single unit hypothesis, 
that if fortuitous intermittent rein- 
forcements were eliminated from a 
nonreversal shift, it would occur more 


3 Figure 4 highlights the problem of what 
are the effective stimuli that are associated 
to the overt response in both a reversal and 
nonreversal shift. It is not intended to be 
a detailed analysis of which there may be 
several alternatives. For example, in a 
single unit theory the habit to choose the 
large container might result from learning 
two separate specific habits (e.g., the choice 
of a large black container when coupled 
with a small white one and the selection 
of a large white container when paired with 
a small black one). Another possibility, 
which would be consistent with Spence’s 
theory (1936), is that the response is to 
the effective stimulus large since responses 
to the other features of the environment are 
not consistently reinforced. Similarly adult 
subjects in a reversal shift might use the 
mediator size or large or both. The effec- 
tive stimulus which is controlling the organ- 
ism’s response must be determined by ex- 
perimentation. The point made here is that 
the general implications of the single unit 
and mediational theories, as discussed in this 
paper, would be the same for a number of 
different effective stimuli. 


rapidly than a reversal shift. The 
reason for this is that at the time of 
the shift the difference between the 
strength of the dominant incorrect 
habit and the to-be-correct habit is 
much greater for the reversal, as com- 
pared to the nonreversal shift. Con- 
sequently more training will be re- 
quired to make the correct habit domi- 
nant in a reversal shift. According to 
the mediational theory the situation is 
entirely different. A reversal shift 
enables the subject to utilize the same 
mediated response. Only the overt 
response has to be changed. A non- 
reversal shift, on the other hand, 
required the acquisition of a new 
mediated response, the cues of which 
have to be attached to a mew overt 
response. Because the old mediational 
sequence has to be discarded and a 
new one formed, the nonreversal shift 
should be executed more slowly than 
a reversal shift. Thus, if it were pos- 
sible to eliminate fortuitous intermit- 
tent reinforcements, then the stage 
would be set for a crucial experiment 
testing the conflicting implications of 
the single-unit and mediational S-R 
theories. The results of a series of 
such crucial experiments (Buss, 1956; 
Harrow & Friedman, 1958; Kendler 
& D’Amato, 1955) have been con- 
sistent with the mediational formula- 
tion in showing that college students 
execute a reversal shift more rapidly 
than a nonreversal shift. It is im- 


4There are two possible ways of analyz- 
ing the superiority of a reversal shift over 
a nonreversal shift within an S-R media- 
tional framework. One is to simply count 
the number of new associations that have to 
be formed. As Figure 4 indicates only one 
new association has to be formed in a re- 
versal shift while two have to be formed 
for a nonreversal shift. Another possibility 
is that a mediating response is more difficult 
to extinguish than is an overt response. 
For the present the formulation can remain 
open-ended until information relevant to 
these two alternatives is gathered. 
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TABLE 1 


MEAN NUMBER OF TRIALS TO CRITERION ON 
Test DISCRIMINATION FOR SUBJECTS SCORING 
ABOVE AND BELOW THE MEDIAN ON THE 
TRAINING DISCRIMINATION 





Performance on training 
discrimination 


Above Median Below Median 
(slow learners) (fast learners) 





Group 





24.4 6.0 
9.0 15.5 


Reversal 
Nonreversa! 





portant to note that in a similar kind 
of problem rats find a nonreversal shift 
easier than a reversal shift (Kelleher, 
1956). Thus, one is forced to con- 
clude that a single unit S-R theory 
accurately represents the behavior of 
rats, while mediational S-R theory is 
required for the concept learning of 
articulate humans. 

The discontinuity between the be- 
havior of rats and college students 
directs one’s attention toward the con- 
ditions responsible for the development 
of mediational processes. Somewhere 
on a hypothetical evolutionary dimen- 
sion between the rat and college stu- 
dent there should be a point where a 
transition is made from a single unit 
to mediational control. An obvious 
place to locate this point would be in 
the behavior of young children. 

A study with kindergarten children 
(Kendler & Kendler, 1959) showed 
that these children as a group executed 
a reversal and nonreversal shift at ap- 
proximately the same rate. One might 
conclude that the point in human de- 
velopment was discovered which was 
psychologically halfway between the 
white rat and the college student, since 
the kindergarten children were neither 
responding in a single unit nor medi- 
ational manner, but instead in some 
compromise fashion. Another possi- 
bility is that the children had reached 
a transitional stage in development, in 
which the task to which they wee 


subjected led some to function on a 
single unit basis, and others to operate 
with a mediational mechanism. If half 
of the subjects respond in each way, the 
total results would have revealed no 
difference between the two kinds of 
shifts. 

The second alternative seems to fit 
the data. When the kindergarten chil- 
dren were divided into fast and slow 
learners on the basis of their perform- 
ance in the first problem (training dis- 
crimination), slow learners performed 
during the second problem (test dis- 
crimination) according to the single 
unit theory ; like rats they found a non- 
reversal shift easier. Fast learners, on 
the other hand, performed in accord- 
ance with the mediational theory; like 
college students, they found a reversal 
shift easier. These results were inter- 
preted as demonstrating that these 
kindergartners, taken as a group, were 
in the process of developing mediating 
responses relevant to this task, and that 
some were further along than others. 

If this interpretation be correct, then 
it would follow that for a group of 
younger (i.e., preschool) children a 


still smaller proportion should develop 


appropriate mediating responses. It 
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Fic. 5. Percentage of children responding 
in a reversal shift manner as a function of 
age. 
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would be expected that such a group, 
taken as a whole, would show clearcut 
evidence of the superiority of a non- 
reversal over a reversal shift. An ex- 
periment (Kendler, Kendler, & Wells, 
1960) designed to test this hypothesis 
produced results consistent with this 
prediction; like rats, nursery school 
children found a nonreversal shift to be 
easier than a reversal shift. 

In a very recent study the experi- 
mental procedure was modified so that 
after learning the initial discrimination, 
the children of 3, 4, 6, 8, and 10 years 
of age who served as subjects, had a 
choice of either responding in a rever- 
sal or a nonreversal manner. Under 
such circumstances, it would be ex- 
pected that the proportion of children 
who respond in a reversal manner 
would increase with age. Figure 5 
shows that the percentage of children 
who chose a reversal shift rose grad- 
ually from 37.5 at 3 to 62.5 at 10. 

Generalizing from all of these re- 
sults, it would seem that in their early 
development, children tend to respond 
in a manner consistent with a single 
unit S-R theory. With age, they de- 
velop a tendency to respond in a media- 
tional manner. The last study cited 
suggests that it is, or will soon be, pos- 
sible to ascertain the lawful relation- 
ship governing the course of this de- 
velopment. 

The point of these experiments is not 
to classify children into one of two 
categories: rat-like or human-like. 
Their aim is to lay the groundwork for 
experiments designed to investigate 
the mediational process itself. If one 
wants to investigate mediational proc- 
esses, does it not seem sensible to scru- 
tinize them at the time when they are 
developing? Answering this question 
in the affirmative, it was decided to 
investigate the relationship between the 
hypothesized mediational processes and 
verbal behavior—a relationship every- 


body assumes to be intimate and im- 
portant. 

Particularly relevant to this attempt 
to coordinate verbalization with media- 
tion were observations that during the 
course of the experiments just de- 
scribed, it was not uncommon for chil- 
dren to verbalize spontaneously the 
correct solution while simultaneously 
making an incorrect choice. A few 
children did this for many consecutive 
trials. This observation is relevant to 
the concept of vertical processes. Two 
chains of habits are occurring simul- 
taneously. One has to do with verbal 
response; the other with the overt 
choice. For these children the two 
chains are parallel, that is, they do not 
interact, 

Luria (1957), the Russian psychol- 
ogist, made somewhat similar observa- 
tions in his research with children. He 
explains this sort of phenomenon in the 
following way : 


In the early stages of child development, 
speech is only a means of communication 
with adults and other children. . . . Subse- 
quently it becomes also a means whereby he 
organizes his own experience and regulates 
his own actions. So the child’s activity is 
mediated through words (p. 116). 


These observations and their inter- 
pretations of noninteracting parallel 
processes point to the complex inter- 
relationships existing between verbal 
behavior on the one hand and problem 
solving on the other. If nothing else, 
they destroy the illusion that it is reas- 
onable to describe an organism as ver- 
bal or nonverbal without considering 
the problem with which it is confronted. 
The terms verbal and nonverbal be- 
come meaningful—and fruitful—when 
related to specific problem solving 
tasks. 

It would seem fruitful to investigate 
the cue function of words for children 
of two age levels. One possibility is 
that age influences problem solving 





10 Howarp H. KENpDLER AND Tracy S. KENDLER 


Second 
Discrimination 


First 
Discrimination 


— - 


L 





+ 


Fic. 6. The experimental procedure used 
to study the influence of verbal habits on a 
reversal shift 











only in so far as it leads to the acqui- 
sition of words. If younger children, 
say 4 years of age, could acquire the 
same words as 7-year-olds, they would 
solve a simple concept-learning prob- 
lem the same way. The other possibil- 
ity is that the acquisition of the verbal 
label by itself is not sufficient ; the word 
must be integrated with other be- 
havioral chains to influence problem 
solving behavior. And for this to hap- 
pen some developmental changes must 
first take place. 

In order to test these two alterna- 
tives, children of 4 and 7 years of age 
were presented with another variation 
of the reversal shift problem as shown 
in Figure 6. They initially learned a 
simple discrimination between a pair 
of stimuli that varied simultaneously in 
size and brightness. In the illustration 
provided in Figure 6, the large black 
square is correct. While they were 
learning, the children were required to 
verbalize aloud the stimuli to which 
they were responding. One-third 
learned to say “large” (or “small” as 
the case may be) by the simple device 
of instructing them to tell the experi- 
menter which was correct, the large 
or the small one. Another third 
learned to say “black” (or “white’’) 


in a corresponding way. The remain- 
ing third was not required to say any- 
thing. After learning the discrimina- 
tion, all subjects were presented with 
a reversal shift. In the example de- 
picted in Figure 6, the shift is to small 
regardless of size. Thus, the group 
that initially described the correct 
stimulus as “large” had verbalized the 
relevant dimension. The verbal re- 
sponse of “black” was irrelevant to this 
reversal shift. 

Figure 7 shows the results of the 
three experimental groups for the two 
age levels. If developmental! processes 
affect the utilization of verbal responses 
in problem solving, then it would be 
expected that the three verbalization 
conditions (which produced a signifi- 
cant main effect) would influence the 
behavior of the two age groups dif- 
ferently. These results suggest, but 
not quite at a significant level, that 
there is an interaction effect. Figure 
7 shows that the younger children 
profited by making the kind of verbal 
response appropriate to a reversal shift, 
while they were hindered by learning 
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Fic. 7. The effect of verbalizations on a re- 
versal shift for 4- and 7-year-old children. 
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inappropriate verbal responses. With 
no verbalization the 7-year-old chil- 
dren who presumably were responding 
largely in a mediational manner, ac- 
complished a reversal shift much more 
rapidly than their younger counter- 
parts. But unlike the 4-year-olds, they 
did not profit from being trained to 
make the relevant responses. At 7 
years of age they are capable of making 
the response themselves and outside 
help appears to be of little use. In 
contrast, the influence of irrelevant 
verbalizations is marked. The per- 
formance of the 7-year-olds was even 
poorer than that of the 4-year-olds, 
suggesting that the interfering effects 
of being given an inappropriate medi- 
ated response are greater when one is 
capable of spontaneously generating 
the correct one (7-year-olds) than 
when one is not (4-year-olds). 

How are these data to be explained? 
Attributing differences to developmen- 
tal factors is not sufficient. It is neces- 
sary to represent developmental differ- 
ences in terms of the concepts of the 
behavior model that is being used. 
That is, if a verbal label for a young 
child does not possess the same cue 
function as it does for an older child, 
then it becomes necessary to specify 
how and why this comes about. To 
some extent this has been done by em- 
phasizing the transition from a single 
unit to a mediational system, as well 
as suggesting that with age an increase 
occurs in interaction among chains of 
different vertical levels. But obviously 
this analysis of the developmental proc- 
ess demands further theoretical and 
empirical development. 

These studies are intimately related 
to the oft-reported finding that many 
species of subhuman animals are able 
to make a fairly rapid reversal shift 
if they receive a previous series of 
such shifts. Rats (Buytendijk, 1930; 
Krechevsky, 1932) show a marked 


improvement in executing successive 
reversals, They finally reach a point 
(Dufort, Guttman, & Kimble, 1954), 
in a T maze, where they learn to go to 
a new rewarded goal after making only 
one error. Even more dramatic are 
the rapid discrimination reversals ex- 
hibited by Harlow’s (1949) monkeys. 
But fish (Wodinsky & Bitterman, 
1957) exhibit only a slight improve- 
ment in successive reversals, while 
isopods (invertebrates) show no im- 
provement (Thompson, 1957). 

Because of the necessity to use some- 
what different experimental procedures 
for different species, it is difficult to 
draw an unqualified conclusion about 
the ability of different species to trans- 
fer what has been learned from previ- 
ous reversal shifts to a new one. But 
the suggestion is strong that as you 
ascend the evolutionary scale organ- 
isms acquire a greater capacity to gen- 
erate cues that enable them to make 
rapid reversal shifts. This behavior, 
according to our analysis, borders on 
the language responses of humans. 
The main difference is that our hu- 
man subjects, except those of a very 
young age, exhibit rapid reversals with- 
out any previous reversal training. 
Whereas the human automatically 
seems to generate a mediated response 
that provides the basis for his rapid 
reversal, the anima! subject must grad- 
ually acquire an ability to respond ap- 
propriately to some response produced 
cue resulting from nonreinforcement 
of a previously correct response. 

Up to now, the reversal and non- 
reversal technique has been used to 
investigate mediational and develop- 
mental variables. It has proved suf- 
ficiently flexible to be used in a study 
(Kendler, Glucksberg, & Keston, 
1961) which was designed to lengthen 
a problem solving chain so that the 
interaction between various segments 
could be observed. In this study a 
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Fic. 8. The hypothesized behavioral chain operating at the time the subject was being 


shifted to the second concept. 


(Capital letters refer to directly observable stimulus and 


response events, while small letters refer to those that are inferred.) 


perceptual orienting S-R unit was 
added on to the mediational chain al- 
ready described. Figure 8 illustrates 
in an oversimplified manner the be- 
havioral sequence involved in this 
study in which subjects had to learn 
to press the correct button when two 
physically discrete and spatially sepa- 
rate stimulus patterns were projected 
on a screen at such a rapid rate that 
only one could be perceived on any 
trial. During the learning of each of 
two successive concepts (involving 
either a reversal or nonreversal shift), 
the subject had to pay attention to the 
relevant stimulus pattern while ignor- 
ing the irrelevant one. Thus, in order 
to make the correct overt response 
consistently, a subject initially had to 
make the appropriate orienting re- 
sponse in order to perceive the relevant 
stimulus pattern to which he had to 
make the correct mediational response 
which served as the cue for the key- 
pressing act. 

An experimental design was used in 
which, at the time of the shift from the 
first to the second concept, one group 
had already learned the appropriate 
orienting response as well as the ap- 
propriate mediating act. They needed 
only to learn a new terminal key- 
pressing response. The shift, for them, 
was easy to make. In contrast, the 
behavior of three other experimental 
groups was significantly worse. One 
group had to learn a new orienting 
response, e.g., look to the left instead 
of the right. Another group had to 
learn a new mediated response (i.e., 
they were required to make a non- 


reversal shift). The last group had to 
acquire both a new orienting and medi- 
ated response. The fact that the 
groups which were missing.one or both 
of the necessary behavior units (orient- 
ing and mediated responses) did not 
differ significantly among themselves, 
as well as being much poorer than the 
group that had both, highlights the 
problem of synchronizing the S-R units 
in a behavioral chain. The advantage 
in this study of having one appropriate 
unit without the other is at best neg- 
ligible. The reason for this is that 
reinforcement is only achieved con- 
sistently when both the appropriate 
orienting and mediating responses are 
operating. This particular study points 
to the need for discovering laws asso- 
ciated with the strengthening and 
weakening of independent S-R units 
in a problem solving chain, as well as 
the principles governing their syn- 
chronization. 

This study also highlighted a very 
basic problem in all of these reversal 
studies. This problem has to do with 
the very first correct response follow- 
ing the reversal shift. After discover- 
ing that the previous mode of respond- 
ing is erroneous, what makes the 
subject change his response, i.e., push 
the button that was previously wrong? 
Introspective reports fail to provide 
any clearcut answer and even if they 
did they would be in need of explain- 
ing (Kendler, 1961). 

One hypothesis is that the selection 
of the new correct response is due to 
the operation of a behavioral chain in 
addition to the one described in Figure 
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8. The first nonreinforcement in a re- 
versal shift sets off a chain, the conse- 
quence of which is to select the re- 
sponse other than the one that was 
previously correct. This may result 
from a number of different reasons 
(e.g., logical considerations, forgetting, 
etc.). The important point, however, 
is that the new key-sorting response 
occurs contiguously with the implicit 
mediational response appropriate to a 
reversal shift. As a result, a new as- 
sociation is formed between the old 
implicit cue and the new key-pressing 
response. 

In essence, what is being stated is 
that adult subjects, when making or 
deciding to make the first correct post- 
shift response, do not adopt the prin- 
ciple underlying a reversal shift. In- 
stead, it is assumed processes are 
operating which encourage the selec- 
tion of the correct response while an 
implicit cue appropriate to a reversal 
shift is operating. This sort of an 
analysis was described previously 
(Kendler & Mayzner, 1956) as 


sort of a James-Lange theory of problem 
solving . . . one makes the overt correct... 
response and if the appropriate symbolic cue 
is present, then problem solution will occur 


(pp. 247-248). 


Guthrie (1959) says the same thing 
more neatly: “What is being noticed 
becomes a signal for what is being 
done” (p. 186). 

Again the authors would like to 
guard against giving the impression of 
oversimplifying a terribly complex 
problem. They do not believe the 
contiguous occurrence of an implicit 
cue from one chain with the correct 
overt response from another tells the 
whole story. This new association in 
order to persist must be reinforced and 
in some manner “fit into” the subject’s 
ongoing behavioral chains. 

The emphasis on this vertical con- 
nection between a cue and a response 


from different chains is related in a 
distant way to Hebb’s (1958) stressing 
the role of “chance” in problem solv- 


ing: 


There are few scientists who have not had 
the experience of setting out to solve prob- 
lem A and ending up instead with the solu- 
tion to B. . . . This is serendipity, the art 
of finding one thing while looking for an- 
other (p. 215). 


According to the present analysis, 
serendipity results from the adventi- 
tious and contiguous occurrence of a 
cue and a response which are them- 
selves segments from different be- 
havior chains. Theoretically it should 
be possible to demonstrate this point 
experimentally by training subjects to 
respond simultaneously to two sepa- 
rate tasks. A problem then would 
have to be presented that requires for 
its solution the combination of a stimu- 
lus from one chain with the response 
from the other. In such an experi- 
mental situation, controlling the time 
relationship between the two should 
have an important effect on problem 
solving. Presumably contiguity be- 
tween the two should provide the most 
optimal conditions for problem solving 
(Underwood, 1952). The develop- 
ment of this kind of experimental pro- 
cedure should allow for parametric 
studies of the basic variables of the 
phenomenon which has commonly been 
called “insight,” as well as throw light 
upon issues raised by others (e.g., 
Cofer, 1957; Maltzman, 1955; Saug- 
stad, 1957). 

The pretheoretical model that guides 
the present research has many more 
facets that can be exploited. Only one 
will now be mentioned. Glucksberg 
(1962), for example, extended neo- 
behavioristic drive theory (Spence, 
1956) to problem solving. He used a 
functional-fixedness problem (Adam- 
son, 1952, Duncker, 1945) in which 
the correct response in the habit hier- 
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archy could either be made to be low 
or high. If the correct habit was low, 
it would be expected that a strong 
drive would retard problem solving 
because it would retard the extinction 
of the dominant incorrect response 
(Kendler & Lachman, 1958; Perin, 
1942). Since drive energizes behavior, 
a high drive should facilitate problem 
solving performance when the correct 
habit is dominant. The findings were 
consistent with this analysis. 

Because functional-fixedness prob- 
lems are often represented in percep- 
tual terms, Glucksberg was interested 
in seeing whether the same drive model 
could be applied to a simple perceptual 
recognition problem in which subjects 
were instructed to identify tachisto- 
scopically presented words as rapidly 
as possible. The results were similar 


to those reported for the functional- 
fixedness study: when the correct re- 
sponse was dominant, an increase in 


drive improved performance, i.e., the 
visual duration threshold was lowered. 
In contrast, increasing drive when the 
correct response was low in the hier- 
archy raised the threshold. 

There is obviously still much more 
work, both empirical and theoretical, 
needed to develop the model that has 
been described. At this point it may 
be appropriate to summarize the major 
points of this paper. 

There is not just one way to investi- 
gate problem solving. The researcher 
who is interested in problem solving 
has several different pretheoretical 
models from which to choose. This 
paper reported the results of a research 
program based on an S-R model in 
which the importance of horizontal and 
vertical processes were emphasized. 
Horizontal processes refer to the link- 
ing of successive S-R units into a 
behavioral chain, while vertical proc- 
esses refer to the assumption that in- 
dependent chains occur simultaneously. 


A series of experiments was reported, 
the implications of which supported 
postulating a mediational mechanism 
within a behavioral chain. By com- 
paring the behavior of human subjects 
of different ages, as well as relating 
their results to lower animals, it was 
possible to infer that as a child matures 
he makes a transition from responding 
on the basis of a single unit S-R mech- 
anism to a mediational one. Addi- 
tional data were cited that suggest the 
full impact of verbal behavior on prob- 
lem solving depends on developmental 
processes that encourage interaction 
between chains at different vertical 
levels. It was also suggested that 
problem solving begins in a simple 
concept learning task when a correct 
overt response from one behavioral 
chain occurs contiguously and adven- 
titiously with the appropriate implicit 
cue from another chain. The paper 
was concluded by citing findings that 
suggested the neobehavioristic drive 
theory which assumes that the effect 
of different levels of drive depends on 
the position of the correct response in 
the habit hierarchy is applicable to a 
functional fixedness problem as well as 
a perceptual-recognition task. 

If nothing else, it is hoped that the 
present paper demonstrates that it is 
possible to investigate problem solving 
in a systematic fashion. If more psy- 
chologists accepted this possibility and 
were willing to expend their research 
energies in the field of problem solving, 
progress in this area would be greater 
than it is today. 
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A QUANTITATIVE THEORY OF FEELING: 1960° 
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In recent years considerable evi- 
dence has been presented which is 
contrary to the theory of von Frey 
(1895) that qualitative differences in 
cutaneous sensations are dependent 
upon the selective action of specialized, 
encapsulated end organs. Much of 
this evidence is from histological stud- 
ies of skin of various parts of the body 


and leads to the conclusion that en- 


capsulated end organs cannot be found 
in hairy skin. Yet hairy skin exhibits 
sensitivity equal to or greater than that 
found in glabrous skin (Sinclair, Wed- 
dell, & Zander, 1952). 

This paper reviews studies con- 
cerned with the manner in which 
nerves terminate in skin and their role 
in cutaneous sensitivity. The histo- 
logical studies lead to the conclusion 
that nerve fibers supplying hairy skin 
terminate as bare, unencapsulated fila- 
ments in relation to three types of 
tissue: undifferentiated epithelial cells 
of the dermis and lower layers of the 
epidermis, hair follicles, and smooth 
muscle elements of the cutaneous blood 
vessels. Nafe (1934, 1942) has pro- 
posed that these bare filaments are es- 
sentially alike and hence respond to a 
common adequate stimulus, namely, 
movement either in relation to them- 


1 The first paper under this title was pub- 
lished by John P. Nafe in 1929. The present 
paper was presented in modified form at 
a symposium on Cutaneous Sensitivity at 
the Army Medical Research Laboratory, 
Fort Knox, Kentucky, February 11-13, 1960. 
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selves or to the surrounding tissue. 
These terminals differ primarily in 
that they terminate in different tissues 
of the skin. Both tactile and thermal 
sensations result when tissue is moved, 
the former by the direct action of a 
mechanical stimulus on the tissue, and 
the latter, by movement of the ther- 
mally labile smooth muscle, especially 
of the arterioles. 


HisToLoGy OF SKIN RECEPTORS 


Rose and Montcastle (1959) have 
reaffirmed the position of most text- 
book writers that the skin senses are 
mediated by specialized encapsulated 
endings. Commenting on the recent 
publications by the Oxford group 
(Weddell, Palmer, & Pallie, 1955), in 
which it is maintained that there is but 
a single type of nerve ending mediating 
cutaneous sensations, Rose and Mont- 
castle (1959) say: 


If a crisis exists in respect to evaluating 
the morphology of the endings, it is a crisis 
of abundance and not of scarcity. One hesi- 
tates to accept as a solution to the vexing 
problem of the morphology of the encap- 
sulated endings a declaration that virtually 
all morphological differences between them 
are either insignificant or due to artifacts 
of the technique (p. 390). 


If the Oxford group were the only in- 
vestigators to arrive at such conclu- 
sions, one might expect such a 
response as quoted above. The con- 
troversy as to whether or not encap- 
sulated endings mediate the various 
qualities of cutaneous sensations has 
a much longer history. 

Three major methods have been em- 
ployed in the attack on the problem of 
the histology of cutaneous sensation. 





Investigator 


Donaldson (1885) 
Goldscheider (1886) 
Haggqvist (1913) 
Dallenbach (1927) 
Pendleton (1929) 
Woolard (1935) 
Bazett (1941) 
Weddell (1941) 
Belonoschkin (1933) 
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TABLE 1 


SUMMARY OF HISTOLOGICAL INVESTIGATIONS RELATING SPECIALIZED ENCAPSULATED 
ENp ORGANS WITH CUTANEOUS SENSATIONS 


Skin area 
Sensations tested 


Glabrous 


Hairy 





Type I Investigations 


cold 
cold, warm, touch, pain 
cold, warm 
cold, warm 
| cold 
| cold, pain, touch 
cold 
cold 


cold 


forearm 

| forearm 

| forearm 

| upper arm 

| forearm 
| thigh 
forearm 
| forearm 
nipple 





Type II Investigations 





von Frey (1895) 


Strughold (1925) 
Strughold and Karbe (1925) 
Bazett et al. (1932) 


Sinclair et al. (1952) 


cold | 
cold, warm, touch, pain 
cold | 
| cold 
conjunctiva | | cold | 
prepuce | cold, warm, touch | 
| 


finger 
forearm | 
auricle 


| 

| conjunctiva | 

| cornea h 
forearm 


| mouth mucosa 


cold, warm, touch | 


cold, warm, touch 
cold, warm, touch 


Encapsulated endings 
reported 


none 
none 


| none 


none 

none 

none 

none 

Krause end bulb 

complicated plexuses with- 
out capsules 


Krause end bulb 

none 

Krause end bulb 

Krause end bulb 

Krause end bulb 

Krause end bulb, Ruffini 
cylinders 

Meissner corpuscle, 
Merkel's discs, Krause 
and Ruffini endings 

none 

none 





Type III Investigations 





Gilbert (1929) 


Gilmer and Haythorn (1941) 
Hagen et al. (1953) 


Dastur (1955) 


Winkilmann (1955) 
Woolard (1936) 


Gilbert (1929) 


Williams et al. (1929) 


Cathcart et al. (1948) 
Gairns (1955), Gairns (1951) 
Gairns (1953) 


The first involves careful sensory test- 
ing and permanent marking of the 


thigh 
epigastrium 
chest 
breast 
back 
abdomen 
finger dorsum 
232 specimens from 
hairy skin 
hairy skin 
thigh 


foot sole 


nipple 
prepuce 


nipple 


palate 
gum 


tongue 














method 


most direct approach. 
involves a determination of 


} none 


none 
none 
none 
none 
none 
none 
none 


none 
Meissner corpuscle 
Merkel’s disc 

Meissner corpuscles 
Golgi-Mazzoni capsules 
Krause end bulb 
Ruffini cylinders 
Meissner corpuscles 

4 unnamed types 


«Krause end bulb (rare) 


unnamed smaller capsules 
Meissner corpuscles 
Krause end bulb 
unnamed varieties 
Krause end bulb 
Meissner corpuscles 


The second 


points on the skin which are sensitive 
to thermal and/or tactile stimulation. 
After sensory testing and marking, the 
tissue is removed, stained, and ex- 
amined microscopically. This is the 


the number and kind of sensory spots 
found in a skin area followed by a 
histological examination of the tissue. 
The number and type of encapsulated 
end organs are related to the fre- 
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quency and kind of sensory spots, but 
no attempt is made to draw a close 
relationship between a spot and a par- 
ticular end organ as in the first method. 
The third only describes the general 
location and distribution of various 
kinds of encapsulated endings. The 
second and third methods give sup- 
portive data at best. * 
Investigations of the first type have 
been carried out at least nine times 
since the first report of Donaldson 
(1885). With the exception of the 
study reported by Belonoschkin 
(1933), these investigations have been 
carried out on hairy skin. It will be 
noted in Table 1, with one exception, 
that the investigators have failed to 
find encapsulated endings of any sort 
beneath the spots of stimulation. In- 
vestigations of the second type have 
been conducted primarily on special- 
ized tissue, such as is found in the 
conjunctiva, the prepuce, and the 
While there 


palmer side of the finger. 
is no direct correlation between a sen- 
sory spot and the type of ending be- 
neath it, in general, these investigators 
conclude that the number and type of 


encapsulated endings correspond 
rather well with the number and type 
of sensory spots reported in that tissue. 
The list of investigations of the third 
type included here is by no means ex- 
haustive. The inclusion of these au- 
thors is on the basis of their interest 
in cutaneous sensation. It will be 
noted that, with one exception, when 
specimens of skin were removed from 
the hairy regions of the body, encap- 
sulated endings were not found; 
whereas, when the specimens of skin 
were obtained from the nonhairy por- 
tions of the body, encapsulated endings 
were quite prevalent. 

In this entire list of investigations 
there are three exceptions to the gen- 
eral conclusion that encapsulated end- 
ings do not occur in hairy skin. These 


are Weddell (1941), in the first type 
of investigation; von Frey (1895), in 
the second type of investigation; and 
Woolard (1936), in the third type of 
investigation. Weddell et al. (1955) 
now contend that the structures which 
he and Woolard (1936) reported as 
being encapsulated endings were either 
unusual sections of hair follicles, or 
artifacts which resulted from rough 
handling of the specimens during the 
fixing process. This appears to be 
reasonable in view of the results ob- 
tained by the other investigators. The 
conclusion is inescapable that encap- 
sulated endings do not occur in hairy 
skin. They exist only in specialized 
skin areas as the conjunctiva of the 
eye, the mucous membrane of the 
mouth, the palmer surfaces of the fin- 
gers, and the palms of the hand, the 
soles of the feet, and the genitalia. In 
hairy skin nerves terminate as bare 
filaments. The manner of termination 
of these afferent nerve fibers has been 
investigated by Weddell, Pallie, and 
Palmer (1954), and the work of other 
investigators has been reviewed by 
Weddell et al. (1955). In general, 
afferent nerves terminate as bare fila- 
ments in relation to three types of 
structures found in the skin. These 
are: in and among the cells of the stra- 
tum granulosum of the epidermis and 
among the cells of the dermis; some of 
the largest fibers serving the skin end 
in relation to hair follicles, as filaments 
entwined about the hair shaft; and 
other smaller fibers terminate in rela- 
tion to the smooth muscle cells of the 
cutaneous blood vessels. 

Punctate sensitivity of the skin, as 
described by Blix (1884) and Gold- 
scheider (1884) is often cited as sup- 
porting evidence for the specialized en- 
capsulated receptor theory. Some 
areas or spots of skin are, indeed, more 
sensitive than others when near thresh- 
old intensities of stimulation are used. 
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However, in studies in which intensity 
of tactile (Guilford & Lovewell, 1936) 
or thermal (Jenkins, 1940) stimuli 
were systematically changed, the num- 
ber of sensitive spots varied directly 
with the intensity of the stimuli. It 
appears that the skin is more or less 
continuously sensitive to tactile and 
thermal stimuli, exhibiting peaks and 
valleys of sensitivity. 


CUTANEOUS SENSATION 


It is no longer profitable to hypothe- 
size encapsulated end organs which 
are sensitive to specific types of energy 
applied to the skin. Another organiz- 
ing principle must be found to account 
for qualitative differences in sensations 
derived from cutaneous stimulation. 
The filaments in which these nerves 
terminate appear morphologically alike, 
although it cannot be maintained that 
they are identical (Sperry, 1950; 
Weiss, 1947). On the other hand, 
there seems to be little reason to as- 
sume that the differences are suffi- 
ciently great to account for the’ wide 
variety of sensory experiences derived 
from their stimulation. Since there is 
no reason, at present, to assume that 
they are different, it would be expected 
that the terminals of these cutaneous 
fibers would share a common adequate 
stimulus, regardless of the tissue in 
which they end. Evidence has been 
presented (Nafe & Kenshalo, 1958; 
Nafe & Wagoner, 194la, 1941b) which 
indicates that movement is this com- 
mon adequate stimulus. Differences in 
the qualities of cutaneous sensations 
are based upon differences in the prop- 
erties of the tissue innervated. Tactile 
sensations are aroused by and during 
movement of dermal and epidermal 
tissue. Thermal sensations are also 
aroused by tissue movement, although 
in this instance, the tissue is the smooth 
muscle composing the walls of the 


cutaneous’ vascular 


1942). 


system (Nafe, 


The Tactile Sense 


Two components exist in the appli- 
cation of any mechanical stimulus to 
the skin. These are a static com- 
ponent (or magnitude of displace- 
ment), and a dynamic component (or 
velocity and acceleration). Accelera- 
tion has not been well investigated, 
although other work (Hubbard, 1958) 
seems to indicate that it is, of little sig- 
nificance. 

The earlier theories concerned with 
the nature of the adequate stimulus 
for the tactile sense considered the 
static component of the mechanical 
stimulus to be the relevant aspect of 
that stimulus; thus, there is the pres- 
sure theory (Weber, 1846), the gradi- 
ent theory (Meissner, 1859), and the 
tension theory (von Frey & Kiesow, 
1899). von Frey and Kiesow formu- 
lated their theory on the basis of stim- 
uli involving very small areas; that 
is, stimuli with radii of less than 0.1 
millimeter. They found that tension 
(weight per unit radius) was related 
to the threshold of sensation for these 
very small stimuli. On the other hand, 
for larger stimuli, greater than 0.4- 
millimeter radius, they were able to 
relate only the rate of application of 
the stimulus to the threshold of sensa- 
tion. Apparently, von Frey and Kie- 
sow were so convinced of the impor- 
tance of the static component of the 
mechanical stimulus that they ignored 
the rate factor involved in the stimuli 
of larger radii. 

Nafe and Wagoner (194la, 1941b) 
were the first to record the motion of 
a mechanical stimulus in producing a 
tactile sensation. Among the recorded 
variables of weight, area, time to cessa- 
tion of the sensation, and the rate of 
stimulus movement at the cessation of 
the sensation, a relationship was es- 
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tablished between weight and the rate 
of stimulator movement at the time 
of cessation of the tactile sensation. 
The rate of movement varied inde- 
pendently of the area at the time of 
cessation of sensation; hence,. any as- 
pect of the stimulus which involves this 
variable, e.g., pressure or tension, must 
be considered to be of no importance 
in producing and maintaining a tactile 
sensation. In addition, they found that 
depth of deformation of the tissue was 
a function of the applied pressure and 
must also be rejected as the stimulus. 
In other words, they were not able to 
relate any of the static aspects of the 
stimulus to the point at which cessation 
of the sensation occurred. As a result, 
they concluded that the dynamic com- 
ponent was the relevant aspect of the 
stimulus; further, that cessation of the 
tactile sensation was the result of fail- 
ure of the dynamic component of the 
mechanical event to stimulate, rather 
than failure of the receptor to respond 
to the stimulus. The rate of tissue 
deformation had reached a subliminal 
value. 

To test this notion, Nafe and Wag- 
oner (1941b) lowered a weight of 
17.5 grams onto the skin is 8.75-gram 
increments. The subject reported a 
tactile sensation with the application of 
each increment and cessation of the 
sensation when the rate of stimulus 
movement became small. Further- 
more, when the second 8.75-gram 
weight was removed, the subject again 
felt a tactile sensation which disap- 
peared at approximately the same rate 
of tissue movement as before. They 
contend that had the receptors, rather 
than stimulation, failed, the subject 
would not have felt the application of 
the second weight, nor would he have 
felt its removal. 

In order to locate this effect in the 
peripheral receptor mechanism, rather 
than in the central nervous system, 


STIMULUS EVENT 


ae 


DISPLACEMENT 





VELOCITY 


Fic. 1. Hypothetical response in a nerve 
fiber to a stimulus. (If the tactile receptor 
responds to the static component of a stimu- 
lus, action potentials should occur through- 
out stimulation as shown in the second line 
labeled “Displacement.” The neural re- 
ponse to the dynamic component of the 
stimulus, in the third line, will last only 
during movement of the stimulus.) 


Nafe and Kenshalo (1958) repeated 
Nafe and Wagoner’s experiments us- 
ing a slightly modified version of the 
Nafe and Wagoner apparatus. In 
place of the human observer, an ani- 
mal, usually a rat, was used. Instead 
of a verbal report by the subject, action 
potentials were recorded from the 
nerve innervating the stimulated tis- 
sue. Simultaneous recordings of the 
stimulus action and the neural dis- 
charge were made on photographic 
paper. In most of the preparations, 
the tongue of the rat was stimulated 
and recordings were made from the 
lingual branch of the fifth cranial 
nerve. Some preparations were made 
of the shaved leg of the rat and records 
were made from the femoral nerve, 
while in other experiments, the skin 
of frog was used. 

Figure 1 shows the type of record 
which would be predicted if the recep- 
tors of the skin respond to either the 
static component (displacement) or 
the dynamic component (velocity) of 
the mechanical stimulus. 

Figure 2 shows a record of the stim- 
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TONGUE —LINGUAL~ RAT 
| 3-23-54 
8.75 GM. 


Fic. 2. Action potentials in the lingual 
nerve in response to weight applied to the 
tongue of rat. (Movement of the weight 
into the tissue is shown in the top line. This 
is comparable to only the left half of Figure 
1. In other experiments the weight was 
later lifted. Action potentials were also 
obtained during its removal—modified from 
Nafe & Kenshalo, 1958.) 


ulus resulting from an 8.75-gram 
weight sinking into the tissue of rat 
tongue, and the neural discharge 
which results. In this type of re- 


ISEC. , 





AT” 





cording, the stimulator follows the ad- 
justment of the tissue to the weight 
rather closely. It can be seen that 
the neural discharge which accom- 
panied application of the stimulator 
stopped before the tissue had com- 
pletely adjusted to the added weight. 
The stimulator was still moving into 
the tissue, in this instance at the rate 
of 0.019 mm/sec, when the neural dis- 
charge stopped. Records such as these 
establish the point that the cessation of 
sensation associated with a minimal 
rate of movement is a_ peripheral 
event. 

The question still remains as to 
whether the cessation of discharge is 
a function of the stimulus effectiveness 
alone, or if it is in part due to adapta- 
tion by the receptor. To test this, the 
stimulus was hydraulically driven into 
the tissue in two steps of equal veloc- 
ity, but the second step was of less 
magnitude than the first. If the cessa- 
tion of discharge were associated with 
receptor adaptation rather than a stim- 
ulus parameter, the neural discharge 
associated with the second step should 
be less than that of the first. On the 
other hand, if rate of movement is the 
determining condition, the discharge 
to the second step should be equal to 
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Fic. 3. 


hydraulically into the tongue tissue in two steps of equal velocity. 





LINGUAL NERVE—RAT 
4 4-26-55 


Action potentials in the lingual nerve of rat when the stimulator is driven 


(The course of the 


stimulus into the tissue is shown in the top line—modified from Nafe & Kenshalo, 1958.) 
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Fic. 4. 


Hypothetical cross section through skin showing mechanoreceptors at A and B 


in relation to the stimulator. 


or greater than that from the first. 
As can be seen from Figure 3, the 
second hypothesis is confirmed. The 
longer discharge associated with the 
second step is explained by greater 
mechanical stress in the tissue follow- 
ing the application of the first step, 


than prior to its application. Thus, in 
Figure 4, if it is assumed that the ter- 
minals of the fiber from which the rec- 
ord was made are at A, it is apparent 
that movement of tissue at A will be 
greater per unit movement of the stim- 
ulator when A is under mechanical 
stress than when it is not. The ob- 
servations of Loewenstein (1956) 
emphasize the importance of the me- 
chanical factors involved in the stimu- 
lation of skin receptors. He has re- 
ported that the duration of a single 
unit discharge in response to a stand- 
ard stimulus in frog skin is greater 
when the skin is placed under me- 
chanical stress than when it is flaccid. 
Our own observations of a decrease in 
the velocity of stimulus movement 
with increasing weights applied over 
a constant area, may be explained in a 
similar fashion. One might think of 
the relation of the stimulus and recep- 
tor as analogous to a mechanical gear 


train with a large amount of backlash. 
As the gears are put under mechanical 
stress, by loading, e.g., the use of 
spring loaded antibacklash gears, the 
output of the gear train follows that 
of the input more closely. Similarly 
the tissue at A follows movements of 
the stimulator more closely. 

To test these notions it seemed ap- 
propriate to use an unyielding tissue. 
The coupling between the stimulator 
and receptor would be closer and tissue 
readjustments to the stimulus action 
would be minimal compared to the 
rat’s tongue. Frog skin is suited for 
such an investigation. As can be seen 
in Figure 5, the neural discharge oc- 
curred only during the time that the 
tissue was being moved. The second 
interesting aspect is that when the 
stimulator was removed from the 
frog’s back a third series of neural 
impulses occurred. 

Movement of tissue appears to be 
the adequate stimulus for nerve termi- 
nals found in a variety of locations. 
Nafe and Wagoner (1941b) used the 
skin of the web between the thumb 
and index finger and just above the 
knee. Nafe and Kenshalo (1958) 
used the tongue and hind leg of rat, 
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DOR. CUT. FROG 
3 8-19-55 


Fic. 5. Action potentials in the dorsal 
cutaneous nerve of frog as a result of a 
stimulus driven hydraulically into the skin 
and then removed. (The course of the 
stimulus into the tissue is shown in the 
top line—modified from Nafe & Kenshalo, 
1958.) 


and frog skin. There appears to be 
little difference in the mechanism of 
stimulation of the nerve terminals 
which exist in these various locations. 
Differences which have been observed 
may be accounted for by differences in 
the mechanical properties of the tissue 
surrounding them. It has been known 
for some time that nerve terminals 
surrounding hair follicles respond only 
to movement of the hair (Adrian, 
1930). In addition, the works of Gray 
and Malcolm (1950), Gray and 
Matthews (1951), and Gray and Sato 
(1953) have demonstrated that the 
Pacinian corpuscle is also responsive 
to only the dynamic aspect of the 
stimulus—movement. 

According to this concept, tactile 
adaptation, in the sense of a protracted 
but diminishing sensation or nerve dis- 
charge resulting from a constant stimu- 
lus, becomes meaningless. Consider 
the diagram of Figure 4. The rate of 
tissue movement or adjustment at A 
will be much less than the rate of 
movement of the stimulator, but as it 
continues to sink into the tissue, the 


rate of tissue movement will approach 
that of the stimulator. If it is as- 
sumed that a receptor is at A, it is 
clear that the stimulus to which the 
receptor responds is by no means con- 
stant, although the stimulus object, 
which the experimenter sees, is con- 
stant—it weighs 5 grams, or exerts 5 
grams/cm?® pressure. If a solid plate 
such as bone is placed immediately 
beneath the diagram, shearing forces 
are also produced because the tissue 
will be squeezed between the stimu- 
lator and the bone. The result of this 
added complication will be that reso- 
lution of these stresses will require 
much longer at A than at B. Ac- 
cording to the older electrophysiolog- 
ical literature, in comparing the neural 
discharges from receptors at A and B, 
it would have been said that A was 
slow adapting while B was fast adapt- 
ing, implying a difference in the neuro- 
logical functioning of each rather than 
in tissue mechanics. An important 
parameter of the problem is overlooked 
if the mechanical properties of the tis- 
sue are not also investigated. 


The Thermal Sense 


Before considering proposals for 
thermal receptors, the facts concerning 
thermal sensation should be examined. 
They impose rather stringent require- 
ments upon any theory concerned with 
the nature of thermal reception. The 
thermal sense is different from other 
senses. In the other senses absence 
of sensation is associated with near 
zero energy levels. Zero thermal en- 
ergy is at absolute zero, approximately 
— 273° C; yet, psychological zero or 
zero thermal sensation is at skin tem- 
perature—approximately 33° C. 

Stimulation of thermal receptors de- 
pends upon a temporal thermal gradi- 
ent. Within limits, thermal sensations 
occur only when the stimulus tem- 
perature change occurs at above a 
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minimal rate. Skin temperature is 
known to shift spontaneously by as 
much as 0.5° within the space of a few 
minutes (DuBois, 1941); yet during 
these shifts the subject is not aware 
of any thermal sensation. The ob- 
servations of Hensel (1950) indicate 
that skin temperature may be shifted 
within the range of 29°-36° C with- 
out producing a thermal sensation, 
provided the change is at a rate of 
less than 0.007° C/sec. This appears 
to be the zone of complete thermal 
adaptation and hence specifies the 
limits of the shifting psychological 
zero. The shifting zero point accounts 
for the frequent observation that the 
thermal reception system in man is 
not a good absolute thermometer. 
While thermal sensations do not occur 
with large changes in skin temperature 
at a slow rate, threshold sensations oc- 
cur at a rate of 0.001° C/sec for 3 
seconds (Hardy & Oppel, 1936). 

The thermal threshold is affected by 
the temperature of the skin (Hensel, 
1950). Ebaugh and Thauer (1950) 
have reported that the cool threshold 
is increased at skin temperatures above 
33° C. Kenshalo, Nafe, and Dawson 
(1960) confirmed their findings and, 
in addition, were able to show that the 
warm threshold decreases at skin tem- 
peratures above 34° C. 

Theories which depend on a spatial 
thermal gradient must be rejected. 
Ebbecke (1917) and Bazett and Mc- 
Glone (1932a) have suggested spatial 
gradient theories. The notions of 
Bernhard and Granit (1946) and von 
Euler (1947), that nerve itself may 
act as a thermal receptor, is also based 
on regional cooling of the nerve. Lele, 
Weddell, and Williams (1954) have 
extended this notion and suggested a 
Bolometer Theory which depends on 
warming and cooling of terminals in 
relation to their stem axons. Rejec- 
tion of spatial gradients as a means of 


stimulating thermal receptors requires 
some justification. Hensel and Zotter- 
man (1951c), using electrophysiolog- 
ical techniques, located the termination 
of a fiber responding to cold close to 
one side of cat’s tongue. They found 
that the receptor performs equally well 
regardless of which side of the tongue 
was cooled. The older work of Bazett, 
McGlone, Williams, and Lufkin 
(1932) on the prepuce, likewise, dem- 
onstrates the ineffectiveness of the 
direction of the spatial thermal gradi- 
ent as a means of stimulating warm 
receptors. Vendrik and Vos (1958) 
abolished a spatial thermal gradient 
altogether by warming the skin with 
a radar beam of 10-centimeter wave 
length, yet were unable to observe any 
appreciable difference in the warmth 
threshold from that obtained by infra- 
red radiation. Electromagnetic wave 
lengths of this order are quite pene- 
trating, according to them, and warm 
the deeper tissues as rapidly as the 
skin surface. If a spatial thermal gra- 
dient were necessary to stimulate the 
thermal sensing mechanism, they 
should have obtained a much higher 
thermal threshold using the radar 
beam than by infrared stimulation. 
The latter raises the skin surface tem- 
perature and depends upon conduction 
of the thermal energy to the deeper 
tissues. 

These experiments demonstrate the 
ineffectiveness of spatial thermal gradi- 
ents in depth as a stimulus for the 
thermal sensing mechanism. It is still 
possible that gradients across the sur- 
face of the skin may be of some im- 
portance, although it seems unlikely. 
Experiments such as those by Hardy 
and Oppel (1936) in which large 
areas of skin, e.g., the torso or torso 
plus face, were warmed uniformly by 
radiant energy should have produced 
thermal sensations only at the edges 
of the stimulus. Their subjects made 
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no such observation. Similarly, the 
thermal sensation aroused by laying a 
cold coin on the skin should occur only 
at the margin, but this is not consistent 
with common observation. 

Two types of receptor mechanisms 
have been proposed for mediating 
thermal sensations. One assumes that 
cutaneous nerve terminals are differ- 
entially sensitive, some responding 
only to mechanical energy, while 
others respond to thermal energy. 
The other maintains that endings of 
the skin respond to a common ade- 
quate stimulus—movement. Thermal 
reception would then be by means of 
tissue movement resulting from ther- 
mal stimulation. It is well known 
that smooth muscle is thermally re- 
sponsive, contracting when cooled and 
relaxing when warmed. Such move- 
ment induced in smooth muscle of the 
cutaneous vascular system would pro- 
duce impulses in the nerve fibers ter- 
minating among these smooth muscle 
elements. 

The notion that nerve terminals re- 
spond differentially to mechanical and 
thermal energy is supported by evi- 
dence from Hensel and Zotterman 
(1951b). They found that stem axons 
which show activity in response to the 
application of mechanical stimuli to 
their terminals do not respond well to 
thermal stimuli and vice versa. Their 
findings indicate that mechanically 
sensitive nerve terminals have stem 
axons of larger diameter (larger than 
8 micra); whereas, those responding 
to thermal stimuli are smaller (1.5-5 
micra). They interpret their findings 
as indicating that nerve terminals are 
inherently different in their respon- 
siveness to thermal and mechanical 
energy. However, such differences 
could occur as a result of differences 
in the mechanical and thermal prop- 
erties of the tissue in which the nerve 
terminals end. 


Several mechanisms have been sug- 
gested to account for the apparent dif- 
ferential sensitivity of nerve terminals. 
Bishop (1946) has related it to charac- 
teristics associated with the size of 
the stem fiber involved. Others have 
suggested that the nerve terminals 
differ in their sensitivity to chemical 
changes. Bazett and McGlone 
(1932b) proposed that thermal sensa- 
tions might be the result of alterations 
in blood acidity produced by warming 
and cooling. Consistent with this, 
Dodt (1956) has demonstrated a dif- 
ferential effect of CO, tension on the 
response of nerves stimulated ther- 
mally. In his Concentration Theory, 
Jenkins (1941) has also suggested a 
chemical intermediary. Bare nerve 
endings, according to Jenkins, would 
discriminate between a catabolic and 
an anabolic phase of some chemical 
reaction brought about by a change in 
temperature. Presumably bare nerve 
endings responding to mechanical 
stimuli would be insensitive to these 
changes. 

The suggestion of Hensel and Zot- 
terman (1951b) on the mechanism of 
the stimulation of cold receptors is 
reminiscent of the anabolic-catabolic 
hypothesis. Because of the rather 
complex shifts in the frequencies of 
impulses which they found in fibers 
responding to a temperature reduction, 
they suggest that there are two tem- 
perature dependent processes, prob- 
ably chemical, involved in the stimula- 
tion of cold fibers (Zotterman, 1959). 
The impulse frequency resulting from 
cold stimulation is a function of a dif- 
ference in the activity of these two 
processes. Although they do not spec- 
ulate on the neural response to warm- 
ing, one must presume at least one, and 
probably two, additional processes are 
necessary. Dodt (1953) has shown 
that the responses of mammalian A 
fibers to warming and cooling are 
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functions of their diameter. The dif- 
ferences do not appear great enough, 
however, to aid in understanding Hen- 
sel and Zotterman’s suggestion of a 
chemical process controlling the re- 
sponsiveness of certain fibers to ther- 
mal stimuli. 

Hypotheses which attempt to ex- 
plain differences between thermal and 
mechanical sensitivity on the basis of 
nerve terminals of different sensitivity 
must assume two different sets of cu- 
taneous nerve terminals, one set re- 
sponding to a mechanical form of 
energy, the other to thermal energy. 
If the difference between the two sets 
of terminals is a chemical one, the 
chemical reaction should be one which 
is in equilibrium at the mid-range of 
the thermal energy continuum, which 
may have its thermal equilibrium point 
shifted by 6 or 7 degrees if done suf- 
ficiently slowly, yet which will become 
unstable when changed rapidly by as 
little as 0.003° C (Hardy & Oppel, 
1936). To further complicate the 
theory, Hensel and Zotterman (1951b) 
find that of the terminals responding 
to thermal energy, some are activated 
only by an elevation, while others re- 
spond to a reduction in the thermal 
energy. Thermal fibers must now be 
endowed with even greater selectivity, 
either to discriminate between two 
different chemical reactions, one for 
warming and one for cooling, or be- 
tween the directions in which a re- 
versible chemical reaction is driven. 
In addition, the response of terminals 
to cooling is to be explained by the 
interacting of two temperature depen- 
dent processes, according to Zotter- 
man (1959). 

An alternative, involving fewer as- 
sumptions, is that nerve fibers and 
their terminals are largely alike; fur- 
ther that the differentiation necessary 
to account for thermal and mechanical 
responsiveness occurs as a direct result 


of the tissue in which the fibers termi- 
nate. It has been demonstrated that 
nerve terminals, similar to those ter- 
minating among the cells of the epi- 
dermis, end among the smooth muscle 
fibers of the cutaneous vascular net- 
work (Weddel et al., 1954; Woolard, 
1926). Smooth muscle is known to be 
thermally responsive, relaxing upon 
warming and constricting when cooled. 
Furthermore, Nafe and Kenshalo 
(1958) have demonstrated that move- 
ment of the nerve terminal either upon 
itself or in relation to its surrounding 
tissue best fits the facts concerning 
mechanical reception. 

Goldscheider (1886) was probably 
the first to suggest a relationship be- 
tween the cutaneous vascular system 
and thermal sensation. He noted in 
his biopsies of temperature sensitive 
spots the “striking and immediate 
proximity” of blood vessels to cold 
spots which he has previously mapped. 
Elaborating on this, Nafe (1934) 
pointed out the striking relationship 
between changes in the smooth muscle 
response to temperature and the sub- 
jective reports of stimulation by the 
same temperatures. The figures which 
Nafe (1934) cited applied to smooth 
muscle of the uterus and while they 
apply to smooth muscle in general, re- 
cent evidence indicates that the smooth 
muscle of the cutaneous vascular net- 
work, especially in the venules, 


show an extraordinarily prominent vaso- 
motor activity in contrast to other tissues, 
such as mesenteric structures, the wall of 
the intestinal tract, and skeletal muscle, 
where venous vasomotion is minimal and 
seemingly passive (Zweifach, 1959, p. 69). 


Furthermore, the venular tributaries 
in the skin are ten- to twenty-fold more 
sensitive to a fall in temperature of 
1-2° C than other tissues (Zweifach, 
1959). While the Vascular Theory 
is usually thought of as involving only 
the arterioles the responsiveness of 
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venular smooth muscle and that of the 
hair erector muscles should not be 
overlooked. 

The Vascular Theory accounts for 
the requirements of a thermal theory 
as outlined at the beginning of this 
section. It does not depend upon spa- 
tial thermal gradients to achieve stim- 
ulation and is uniquely suited for ex- 
planations involving temporal thermal 
gradients. Reference was made earlier 
in the paper to the fact that in order to 
stimulate nerve terminals located 
among the cells of the epidermis and 
hair follicles, minimal rates of tissue 
movement were necessary. On the 
assumption that nerve terminals are 
essentially alike, the same conditions 
of stimulation should apply to the 
nerve terminals ending in relation to 
the fibers of the smooth muscle of the 
cutaneous vascular system. The shift- 
ing psychological zero with no attend- 
ant thermal sensation is accounted for 
in terms of the constriction or dilation 
of the cutaneous vessels occurring 
slowly enough to be below the thresh- 
old rate of movement necessary to 
excite the nerve terminals in the ves- 
sel walls. Complete thermal adapta- 
tion represents the adjustment of vas- 
cular relaxation or constriction from 
one level to another; and the range 
through which Hensel (1950) has re- 
ported that complete thermal adapta- 
tion is possible, represents the range 
through which the smooth muscle of 
the cutaneous vessels can adjust and 
adopt a relatively stable end point. 
Outside of this range vascular smooth 
muscle would be unable to achieve a 
relatively stable adjustment to the 
temperature. 

Changes in the size of the thermal 
threshold as a function of the skin 
temperature should be expected since 
the latter would govern the degree of 
relaxation or constriction of the 
smooth muscle. Predictions of the 


direction in which the thresholds 
would go are difficult to make because 
of ignorance of the mechanical and 
physiological characteristics of afferent 
innervation of smooth muscle. 

In addition to fitting these require- 
ments, the theory also handles facts 
which have been somewhat perplexing 
to other theories. Thermal sensations 
occur under some circumstances which 
do not include a change in the environ- 
mental temperature. It has _ been 
noted, especially in the earlier work 
on the thermal sense, that mechanical 
stimulation of the skin may produce 
sensations of warm or cool (Nafe, 
1934). Smooth muscle is known, 
upon occasion, to relax or contract 
when prodded, depending upon its 
state of contraction at the moment 
(Evans, 1956). Thermal sensations 
resulting from mechanical stimulation 
can be explained in this manner if one 
assumes that they are mediated by 
smooth muscle of the cutaneous vas- 
cular system. Warm sensations ac- 
companying a blush occur as a result 
of chemically mediated vasodilation 
since there is apparently little if any 
reflex vasomotor control in the “blush 
area” (Burton, 1959). The chill ac- 
companying scratching a fingernail 
across a blackboard is the result of a 
reflexly aroused general vasoconstric- 
tion. 

It is not necessary to postulate dif- 
ferent types of free nerve endings, each 
responsive to one type of energy; nor 
is it necessary to conceive of two dif- 
ferent systems, one for warm and one 
for cool. Two possible explanations 
exist. One depends upon neural pat- 
terns while the other depends upon 
different fibers being stimulated, based 
upon the way in which they terminate 
among the smooth muscle elements. 
In the first instance neural patterns 
aroused by constriction could con- 
ceivably differ from those aroused by 
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relaxation. Patterns in other systems 
result in different sensations. For ex- 
ample, the peristaltic waves in the 
stomach passing from the cardiac to 
the pyloric valves are associated with 
hunger pangs, while those same con- 
tractions passing in the opposite direc- 
tion result in the entirely different 
sensation of nausea. The direction of 
travel is simply a matter of timing. 
In addition to central factors involved 
in thirst, the patterns aroused in the 
membranes of the throat when they are 
dry, in comparison to when they are 
wet, contribute markedly to this sensa- 
tion. The urge to micturate and to 
defecate are likewise results of pattern- 
ing of neural discharges. 

On the other hand, Hensel and Zot- 
terman (195lc), using electrophysio- 
logical methods on the cat’s tongue, 
have described two apparently differ- 
ent sets of nerve fibers, one respond- 
ing to warm and the other to cool 
stimuli. In striated muscle, such sys- 
tems occur; one responding when the 
muscle is contracted and the other 
when it is stretched. In the first in- 
stance the receptor elements are con- 
nected in series with the muscle fibers ; 
whereas, in the latter case they are 
connected in parallel with the muscle 
fiber. Whether the same type of end- 
ings exist in smooth muscle has not 
been investigated, although Fischer 
(1944) has described some afferent 
terminals in smooth muscle as “spray- 
like,” and others as “spindle-like.” 

Hensel and Zotterman (1951b) 
have described fibers responding to 
thermal stimulation as smaller (1.5-5 
micra) than those responding to tactile 
stimulation (8 micra and larger). 
Weddell and Pallie (1954) have 
shown that many of the fibers inner- 
vating the cutaneous blood vessels are 
of dorsal root origin. In addition, 
these fibers are among the smallest 
found in the skin, being, for the most 


part, less than 3 micra in diameter. 
The fibers ending in cutaneous vessels, 
then, correspond with the fiber spec- 
trum which Hensel and Zotterman 
have described as responsible for ther- 
mal sensitivity. 

One of the most frequently cited 
reasons for considering warm and cool 
to be mediated by different receptor 
mechanisms is the long reaction time 
of an observer to warm compared with 
cool stimuli. Bazett et al. (1932) and 
Hensel and Zotterman (195la) have 
used this difference in latency to cal- 
culate the depth of the receptors. 
These calculations are based on the 
assumption that the latency of response 
of the receptors are the same but occur 
at different depths. An alternate ex- 
planation, in terms of the Vascular 
Theory, is equally possible. Constric- 
tion is an active process requiring little 
time to accomplish. On the other 
hand, dilation is a passive response and 
requires a much longer time to reach 
a threshold rate of movement. 

Paradoxical cold provides little diffi- 
culty for the Vascular Theory. Para- 
doxical cold is a term ascribed to the 
situation in which a cold sensitive area 
is stimulated by means of a high tem- 
perature and produces a cold sensation. 
At high temperatures smooth muscle 
has constricting elements within a gen- 
erally relaxing system (Evans, 1956). 
It might be expected that an area 
normally responding to a cool stimulus 
would also produce a cool sensation 
when stimulated by a high tempera- 
ture. It is interesting to note that 
Zotterman (1953) reports fibers which 
normally respond to cooling, after a 
period of no response to temperature 
above 35° C. suddenly start to respond 
when stimulated by temperatures of 
45° C. and higher. 

Sensations of heat are the result of 
a mixed sensation of warm and para- 
doxical cold according to Alrutz 
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(1900). It is known (Evans, 1956) 
that when smooth muscle is warmed 
to above approximately 45° C. ele- 
ments of the muscle begin to constrict 
while the main body of the muscle 
continues to relax. According to the 
Vascular Theory hot sensations result 
from the neural activity brought about 
by the constricting elements in the 
generally relaxing smooth muscle of 
the cutaneous arterioles. 

The question arises as to how the 
sensorium distinguishes between tissue 
movements induced by mechanical 
stimuli and those induced by thermal 
stimulation. Evidence has already 
been presented to indicate that there 
is strong likelihood that different af- 
ferent nerve fibers respond to mechan- 
ical stimulation than to thermal. Ter- 
minals ending in the cutaneous 
vascular system, because of their 


smaller size and probably also because 
of the mechanical properties of the 


tissue of blood vessels, do not respond 
as readily to mechanical stimuli, as 
those which terminate among the cells 
of the epidermis, dermis, and in rela- 
tion to hair follicles. There is no con- 
tention that nerve or its terminals may 
not be stimulated directly by a tem- 
perature change, but it has never been 
demonstrated that their temperature 
sensitivity is sufficient to account for 
thermal sensations in the skin. The 
coding system is based on different 
fibers being activated by different 
types of energy as the result of the 
mechanical and thermal properties of 
the tissue in which their terminals end. 
Further differentiation is accomplished 
in the spinal cord. Descriptions of 
the neuroanatomy of the spinal cord 
(Ranson & Clark, 1959) maintain that 
fibers serving touch travel in the fas- 
ciculi gracilis and cuneatus as well as 
in the ventral spinothalamic tract, 
while those allied with thermal sensa- 
tion and pain ascend the cord pre- 


dominantly in the lateral spinothalamic 
tract, although there is probably con- 
siderable intermingling of fibers among 
the various tracts (Rose & Mont- 
castle, 1959). Fibers from these tracts 
terminate in different brain stem nu- 
clei but their connections are beyond 
the scope of this paper. 

Aside from the fact that the theory 
can account for a large number of the 
facts concerning tactile and thermal 
sensitivity it has the added advantages 
of simplicity and parsimony. It can 
serve as a convenient model within 
which to organize additional facts as 
well as bringing into sharp relief gaps 
in present knowledge. Furthermore, 
such a model allows predictions and 
suggests direct experiments which 
should be done. 


SUMMARY 


Qualitative theories requiring spe- 
cialized encapsulated end organs or 
specialized peripheral nerve fibers to 
account for the various qualities (pres- 
sure, warm, and cold) of cutaneous 
sensation fail to present direct evidence 
that such specialization exists in 
peripheral nerves or in their termina- 
tions. The Quantitative Theory, dis- 
cussed here, maintains that qualities of 
cutaneous sensation are, in part, a 
function of the mechanical and thermal 
properties of the tissue in which the 
cutaneous afferent nerves terminate 
and variations in the temperal and 
spatial patterns of the afferent nerve 
discharge. The latter variations are 
manifest in the frequency and duration 
of the impulses, and the area over 
which they arise as well as the relative 
number of fibers actuated within the 
area. The Quantitative Theory con- 
sists of four postulates. 

1. Nerves which end freely, with- 
out encapsulation of any sort, are the 
primary neurological elements present 
in hairy skin capable of mediating 
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sensations derived from normal stim- 
ulation of the skin. Nerves ending 
freely in the skin terminate in relation 
to different tissues. 

a. Some fibers branch and termi- 
nate among the dermal and epidermal 
cells of the stratum granulosum. 

b. Others, and probably some 
branches of those terminating as in 
(a) form a complex net about the hair 
follicles. 

c. Smaller fibers branch and ter- 
minate among the smooth muscle ele- 
ments in the walls of the cutaneous 
vascular system and perhaps the hair 
erector muscles. 

2. The terminals of nerve fibers 
supplying the skin are functionally 
similar, differing primarily by the tis- 
sue in which they terminate. Being so, 
they share a common adequate stim- 
ulus which is movement of the ter- 
minals either upon themselves or in 
relation to their immediately surround- 
ing tissue. 

3. Smooth muscle responds to ther- 
mal changes, in general, relaxing when 
warmcd but constricting when cooled. 

4. The thermal and mechanical prop- 
erties of the tissue in which the ter- 
minals end determine which fibers will 
respond to mechanical stimulation and 
which will be thermally sensitive. Dis- 
tinction between thermal and tactile 
sensations depends upon the central 
connections which the fibers make. 
Qualitative differences in sensations 
derived from various mechanical stim- 
uli are the result of differing patterns 
of neural excitation. Thus stroking a 
smooth surface results in different 
movement patterns in the tissue and 
hence different patterns of action po- 
tentials from stroking a rough surface. 
Distinction of warm and cool sensa- 
tions results from either different pat- 
terns of action potentials caused by 
vasoconstriction as compared to vaso- 
dilation, or different fiber terminals 


responding to vasoconstriction than to 
vasodilation. 
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This paper reports an attempt to 
apply a simple association learning 
model to the analysis of the influence 
of response and training variables on 
paired-associate learning. The issues 
under investigation are old ones but 
have not been resolved satisfactorily 
in the past. With the aid of the ele- 
mentary learning model, the problems 
are posed clearly and, the data willing, 
adequately resolved by the use of a few 
simple and intuitively compelling as- 
sumptions about learning. 

The first problem that led to this 
investigation concerns the relationship 
betwen the number of response alter- 
natives (N) and error rate in paired- 
associate learning. Experimental re- 


sults (Noble, 1955; Riley, 1952) are 
in agreement in showing that the num- 
ber of errors subjects make before 
reaching some criterion of learning is 
greater the larger the number of re- 


sponse alternatives. There is little 
agreement, however, as to the inter- 
pretation of this fact. 

Two possible factors could be in- 
volved. First, the effectiveness of a 
reinforced trial in increasing perform- 
ance (i.e., the learning rate constant) 
may be influenced by N; and second, 
N may influence the probability of be- 
ing correct by sheer guessing on items 
that are yet unlearned. It is a reason- 
ably safe assumption that N has the 
second effect on chance guessing. 
Previous data are unclear on whether 
N also influences the first factor, the 


1 This investigation was supported in part 
by a grant, M-3849, from the National In- 
stitute of Health, United States Public 
Health Service. 


effectiveness of a reinforced trial. This 
is not an easy question to answer since 
it is difficult to separate the effects of 
these two factors in the data. Guess- 
ing occurs only on unlearned items but 
there is no way to tell by direct ob- 
servation just how many items have 
been learned and how many have been 
guessed correctly on any given trial. 
Moreover, there appears to be little 
hope that more refined or ingenious 
experimental procedures will enable 
us to unconfound these two factors so 
that we may crucially test the hypothe- 
sis that N affects the learning rate. 

This is the type of situation in which 
a theoretical model of learning can 
make a strategic contribution. Indeed, 
without the aid of some formal model 
of the learning process, the question 
of the effect of N can neither be posed 
clearly nor answered clearly. With 
the aid of a theory one can make 
suitable allowance for the guessing 
factor and thus make an assessment 
of the contribution of learning and 
guessing at every stage of the ex- 
periment. 

The model to be presented has been 
formally treated in more detail in a 
previous paper (Bower, 1961). Here 
the theory will be presented informally 
and only those implications relevant 
to the present discussion will be intro- 
duced. For a more formal statement 
of the axioms and theorems, the reader 
may consult the prior report. The 
basic notion of the model is the as- 
sumption that each stimulus item and 
its correct response become associated 
on an all-or-none basis. Considering 
a single item, it can be in either of 
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two states at the beginning of each 
trial: conditioned to its correct re- 
sponse or not conditioned. If the item 
is conditioned at the beginning of a 
trial, then the correct response occurs. 
If the item is not conditioned, then 
the probability of the correct response 
depends somewhat «non the experi- 
mental procedure. In experiments by 
the writer, the su.,ccts were told the 
N responses (Integers 1, 2,... N) 
available to them and were told to 
respond on every trial regardless of 
whether they knew the correct answer. 
If the N numbers occur equally often 
as the to-be-learned responses to the 
items, then the probability that the 
subject will guess correctly on an un- 
learned item is 1/N on the simplest as- 
sumptions ; * correspondingly, his prob- 
ability of guessing incorrectly is 


2The arguments to be developed apply 
only to those conditions in which the re- 
sponse alternatives are immediately avail- 
able to the subject and he is permitted time 
to give a relevant response on each trial 
before the terminal event (e.g., reinforce- 
ment) occurs. This is the procedure most 
familiar to those who run animal subjects 
in choice situations. A more frequent pro- 
cedure in human learning experiments has 
been to give the subject a fixed time inter- 
val in which he may respond, and the ter- 
minal event (e.g., correct response in paired 
associates, next item in a serial list) follows 
after that time interval regardless of whether 
the subject makes a relevant response. To 
remark on the sociology of experimenters, 
there seems to be an empirical correlation 
between the use of experimenter vs. sub- 
ject-controlled exposure time and whether 
the relevant responses per se (as well as the 
S-R associations) must be learned in the 
experimental situation: when the responses 
must be learned,: experimenters usually con- 
trol exposure time; when the response al- 
ternatives are immediately available, either 
via instruction or construction, the subject is 
allowed to control exposure time. Clearly, 
however, there is no necessary entailment 
between these procedural aspects, and one 
may expect the empty cells of the 2x2 
contingency table to be filled by future ex- 
perimentation. 


1—1/N. The following discussion of 
the model is oriented specifically to- 
wards such an experimental procedure. 

With the theory formulated in this 
way, one cannot uniquely specify the 
subject’s state of conditioning on a 
given item from the knowledge that 
he made a correct response, since this 
correct response may have come about 
by guessing on an unlearned item. 
Thus, the theory does not imply that 
the first correct response will be fol- 
lowed by correct responses on all sub- 
sequent trials. However, if the sub- 
ject makes an error, then we can make 
a determinate inference that the item 
was not conditioned at the beginning 
of the trial. 

We have introduced the notions of 
the two states of conditioning and the 
probability of the correct response 
given the item’s state of conditioning 
at the beginning of each trial. We 
assume that each item in initially un- 
conditioned and that the effect of suc- 
cessive reinforced trials is to provide 
repeated opportunities for the item to 
become conditioned. The single 
parameter of the theory is the learn- 
ing rate constant (c) which represents 
the probability that an unconditioned 
item becomes conditioned as the result 
of a single reinforced trial. The prob- 
ability that a reinforced trial fails to 
condition the correct response to an 
unlearned item is 1—c. The prob- 
ability that an item is still not condi- 
tioned after m reinforced trials is 
(l1—c)". If ¢ is larger than zero, 
then this probability approaches zero 
as m becomes large; that is, given a 
large number of reinforced trials, it 
it certain that the item will become 
conditioned on some one of these re- 
inforced trials. When the item be- 
comes conditioned, the probability of 
a correct response jumps from 1/N 
up to 1. 

From the considerations above we 





36 Gorpon H. Bower 


may obtain an expression for gp, the 
probability that the subject responds 
incorrectly to a given item on Trial n 
of the experiment. To obtain gq, we 
note that the probability that the item 
has failed to be conditioned during the 
preceding »—1 trials is (1 —c)*™"*; 
and if the item is not conditioned then 
the probability of guessing incorrectly 
is 1—1/N. Hence, the probability 
of an error on Trial n is given by the 
product of these two factors: 


qn = (1—1/N)(1—¢)"* [1] 


Using this elementary association 
model, the first question of this inves- 
tigation can be restated in a clear man- 
ner: Is ¢ independent of N? If c¢ is 
independent of N, then, given an esti- 
mate of ¢ obtained from one group, we 
should be able to predict in advance 
the performance of other groups 


trained with differing numbers of re- 
sponse alternatives by simply adjust- 


ing the N factor in Equation 1. To the 
extent that these free predictions are 
accurate, we would have good evidence 
for the assertion that N affects guess- 
ing probabilities but has no influence 
on learning rate per se. 

The second question prompting this 
investigation concerns the relation be- 
tween several different training condi- 
tions and performance in paired-asso- 
ciate learning. The two major condi- 
tions of training studied to date are 
the correction and noncorrection con- 
ditions. The more frequently used 
condition is the correction procedure 
in which subject is informed of the 
correct response on every trial regard- 
less of whether he responds correctly. 
In noncorrection training, the subject 
is told right or wrong depending on 
whether his response is correct or in- 
correct. In the following, we will use 
the word “reinforcement” in the gen- 
eral sense intended by Estes (1960), 
viz., as operations’ exerting certain 


general quantitative effects upon re- 
sponse probabilities. In paired-asso- 
ciate experiments, the reinforcing op- 
eration is that of informing subjects 
of the correct response or in some 
way ensuring that this response is the 
last one to occur in the presence of a 
stimulus item. With this interpreta- 
tion, then, on every trial under a cor- 
rection procedure the correct response 
is reinforced. The  noncorrection 
training procedure differs in that the 
correct response is reinforced only on 
those trials when it occurs; since all 
incorrect responses are followed by 
the experimenter simply saying 
“Wrong,” no explicit reinforcement 
operation is involved. We will refer 
to such trials as nonreinforced trials. 
Within the model, an account of 
the noncorrection procedure takes the 
form of specific assumptions about the 
formation of associations following re- 
inforced and _ nonreinforced _ trials. 
Considerations of parsimony lead us 
to assume that the effect on condition- 
ing of a reinforced trial is the same 
whether the reinforcement occurs in 
the context of a correction or noncor- 
rection procedure, viz., the probability 
that an unlearned item becomes condi- 
tioned to its reinforced response is c, 
the same constant as in the case of 
the correction procedure. A new as- 
sumption that is required concerns the 
effect of a nonreinforced trial, when 
the experimenter says “Wrong” fol- 
lowing an error. The assumption that 
has been made elsewhere (Thorndike, 
1932) and that is made here also, is 
that a nonreinforced trial in this situa- 
tion results in no net change in the 
probability of a correct response. This 
assumption is not offered as a uni- 
versal interpretation of nonreinforce- 
ment; however, within the specific sit- 
uation to which the model will be 
applied (i.e., the alternative responses 
are equiprobable initially and a num- 
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ber of different S-R events occur be- 
tween successive presentations of any 
given stimulus) there is some basis for 
thinking that the no change assumption 
may be approximately correct. 

From these assumptions about the 
noncorrection procedure, a difference 
equation can be derived expressing 
the change in the average probability of 
an error from Trial n to Trial n+ 1: 


Qn = Qnl9n] 
+(1—qn)[(1—c)an] [2] 


The first term on the right-hand side 
of Equation 2 exhibits the assumption 
of no change (i.¢., dna =n) when 
the subject makes an error on Trial n, 
with probability qg,; the second term 
shows that if a success occurs (with 
probability 1—q,), then with prob- 
ability c the item becomes conditioned 
(if it is not already) and with prob- 
ability 1 — c the reinforcement was in- 
effective and no change occurs on that 
particular trial. An explicit solution 
of Equation 2 is not available. How- 
ever, Miller and McGill (1952; see 
also Bush & Mosteller, 1955, p. 181), 
have derived the following recurrence 
relation for Equation 2 (the constants 
have been changed appropriately) : 


Qn = (1—1/N) [1 — (1 —¢)**] Qns 
+(1-—1/N)(1-—c)™* [3] 


Equation 3 can be used to compute 
successive values of the average error 
probability once N and ¢ are known. 
In general, the predicted learning 
curve is § shaped, a feature which is 
consistent with the expectation of a 
low rate of improvement early in train- 
ing when the subject is trying to dis- 
cover the correct response. The point 
of inflection of the $ curve will be 
positively related to N. These fea- 
tures seem in qualitative agreement 
with noncorrection results reported by 


Noble (1955), although his experi- 
mental procedure was more complex 
than the one under present considera- 
tion. 

At this point in the analysis, Experi- 
ment I was carried out with four 
groups of subjects learning the same 
list of 10 paired associates. The vari- 
ables were the number of response 
alternatives (the first three or first 
eight integers) and correction vs. non- 
correction training procedure. For 
brevity, the four groups will be re- 
ferred to by the symbols 3-C, 8-C, 
3-NC and 8-NC, where 3 or 8 repre- 
sent the number of responses, and C 
and NC designate correction and non- 
correction training, respectively. <A 
more extensive discussion of the pro- 
cedure and results of Experiment I 
will be deferred till later; it will be 
sufficient here to note one critical result 
of Experiment I since it was this fact 
that led to Experiment II. The fact 
was that strong evidence was obtained 
to support the assumption of no change 
in success probability following a non- 
reinforced trial in the noncorrection 
training groups; that is, with either 
three or eight response alternatives, 
saying “Wrong” followed a subject’s 
incorrect response had no effect on 
his probability of success on the next 
trial. 

This no change result supports our 
earlier assumption about nonreinforced 
trials, but how are we to understand 
it? On rational grounds, one might 
expect that subjects would tend to 
eliminate a response which was fol- 
lowed by “Wrong.” For example, if 
subjects in the 3-NC group tended to 
eliminate their first erroneous re- 
sponse, then the probability of the 
correct response on the next trial 
should be around one-half or at least 
greater than one-third. However, the 
results of Experiment I clearly showed 
that this increase did not occur, and 
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we seek some explanation for why it 
did not occur. 

An explanation may be found per- 
haps by attending to the responses 
evoked in the subject by saying to him 
“Wrong” after he has responded, e.g., 
“three.” The reason for this inquiry 
is that, holding to a strict contiguity 
interpretation for the formation of as- 
sociation, it is these terminal reactions 
to the stimulus which have an oppor- 
tunity to become conditioned. Often 
these implicit reactions to “Wrong” 
are primarily emotional; in this re- 
gard, it may be reported that subjects 
frequently volunteered the information, 
“T know I’ve been getting that item 
wrong, but I can’t remember what 
number I said last time.” Clearly, if 
the subjects’ implicit reactions are pri- 
marily emotional, then we may expect 
no change in the recorded response 
probabilities on the next trial. How- 
ever, if subjects implicitly react to 
“three is wrong” with the response 
“one or two is correct,” when the con- 
tiguity interpretation would imply that 
the probability of response “three” 
would decrease and the probability of 
responses “one” and/or “two” would 
correspondingly increase. 

According to this interpretation, the 
noncorrection procedures in Experi- 
ment I did not insure responses of the 
form, e.g., “one or two is correct” 
after the experimenter said “Wrong” 
to “three.” For one of the groups in 
Experiment II, conditions were ar- 
ranged to insure the occurrence of re- 
sponses in this form following errors. 
There were three response alterna- 
tives and subjects were instructed that 
one and only one number was correct 
for each nonsense syllable. If, for 
example, the subject responded with 
“three,” instead of saying “Wrong,” 
the experimenter said “one or two is 
correct.” It should be noted that, in 
a formal sense, the subject gains no 


more information about the correct re- 
sponse with this procedure than when 
the experimenter says “Wrong,” as in 
the noncorrection procedure. Com- 
parisons of the results of this proce- 
dure with the standard noncorrection 
procedure are given below. 

The second condition run in Experi- 
ment II was aimed at a slightly differ- 
ent question. The question was 
whether there is some more basic way 
of specifying reinforcement contingen- 
cies in this situation rather than merely 
listing them, correction and noncorrec- 
tion. The formulation proposed here 
is that this basic variable is the de- 
gree to which the experimenter speci- 
fies the correct response following an 
error by the subject; to describe it in 
another way, the variable is the size 
of the subset of alternatives within 
which the one correct alternative is 
said to lie. The correction and non- 


correction procedures occupy the ex- 
treme poles of this dimension; in the 


correction procedure, the experimenter 
uniquely specifies the correct response ; 
in the noncorrection procedure, when 
the experimenter says “Wrong,” he 
implicitly specifies “one of the other 
N —1 alternatives is correct.”* To 
illustrate the construction of inter- 
mediate values of this training dimen- 
sion, suppose there were eight response 
alternatives and that the subject re- 
sponds incorrectly with “one” to a 
given item; then the experimenter 
might say “three or six is correct,” or 
“six or eight or two or five is correct,” 
and so on. For the second group in 
Experiment II, with eight responses, 
the experimenter said two numbers 
following errors. The subjects were 
instructed that one of the numbers was 
correct and that the other number was 
a distractor, but there were no cues 

3 However, see the related discussion 


above concerning the subject’s reactions to 
“Wrong.” 
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as to which number was the correct 
one. Of course, if the subject gave 
the correct number, the experimenter 
indicated this to him. 

It is clear that such a continuum of 
training conditions can be constructed. 
It is also clear that variations in this 
training variable should produce 
graded variations in performance in- 
termediate between the two extremes 
produced by the correction and non- 
correction procedures. To describe 
some of the factors involved in this 
prediction, consider the reasons for ex- 
pecting the partial correction group 
listed above (call it the 8-P group) 
to make fewer errors than the 8-NC 
group from Experiment I. 

First, a subject in the 8-P condition 
is expected to make fewer errors be- 
fore his first correct response because 
there is at least some likelihood that 
the correct association will be formed 
following an error. Specifically, we 
assume that before the first correct re- 


sponse the probability that the item 
becomes conditioned following an er- 
ror is c/2, where c is the same learn- 


ing constant as before. In general, if 
the subject is told & alternatives, one of 
which is correct, then we assume that 
the probability of the correct associa- 
tion being formed is c/k on each trial 
before the first success. This formu- 
lation is equivalent to assuming that 
the subject selects at random one of 
the k possibly correct responses to 
rehearse on a given trial, with prob- 
ability 1/k he selects the correct re- 
sponse, and with probability ¢c re- 
hearsal of this correct response results 
in conditioning. 

The preceding analysis applies on 
trials before the first correct response 
occurs. After the subject has been 
once informed of the correct response, 
his probability of recognizing it among 
the two numbers the experimenter says 
(following a subsequent error, if any) 


will be greater than one-half. If the 
subject recognizes the correct number 
and then rehearses it, the effect is 
much the same as a reinforced trial. 
In general, we may let r represent this 
recognition probability following the 
first correct response; because of im- 
plicit rehearsal, r is also the prob- 
ability of a reinforced trial on incorrect 
trials following the first correct re- 
sponse. Specifically, we assume for 
the present case that r is unity; that 
is, following the first correct response, 
we assume that subject can recognize 
with Probability 1 the correct number 
among the two numbers that the ex- 
perimenter says following an error. 
If the subject recognizes the correct 
response and rehearses it, then with 
probability ¢ the conditioned associa- 
tion is formed. 

To summarize the discussion for 
the partial correction groups, we have 
assumed that prior to the occurrence 
of the first correct response the prob- 
ability of conditioning is c/2 on each 
trial; on the trial of the first correct 
response and those trials following the 
probability of conditioning is c. In 
a later section we suggest a number of 
ways to test the details of these as- 
sumptions, 

In the experiments to be described 
there were six independent groups; if 
the model and theoretical assumptions 
are correct, then the data from the six 
groups can be reproduced after esti- 
mating the constant ¢ from one of the 
groups selected arbitrarily. One can 
easily recognize the advantages of cast- 
ing our theoretical assumptions in ex- 
plicit form within such a model. Not 
only can we clearly pose the questions 
of how WN and its interactions with 
training conditions affect error rate, 
but we have also developed a concep- 
tual framework within which it is pos- 
sible to get an answer to these ques- 
tions. The basic learning parameter, 
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c, plays a central role in the theory 
and is not just a curve-fitting constant ; 
for a homogeneous population of sub- 
jects with the same learning materials, 
the underlying theory constrains ¢ to 
be the same for all conditions. If in- 
deed such parameter invariance ob- 
tains (i.e., if c can be transposed from 
one condition to the others), then a 
strong appeal can be made for the 
simplicity and power of the underlying 
theory within which ¢ derives its mean- 
ing. 
Tue EXPERIMENTS * 


The subjects, 88 Stanford under- 
graduates, were required to learn a 
list of 10 paired associates to a cri- 
terion of two successive correct runs 
through the entire list. The stimulus 
items were nonsense syllables of 0-5% 
association value chosen for low intra- 
list similarity. The responses were 
the first three or first eight integers, 


paired randomly with the stimuli for 


each subject. With three response 
alternatives, two of the three responses 
were used for three stimuli each and 
the third response, selected randomly 
for each subject, was paired with the 
remaining four stimulus items. With 
eight response alternatives, all eight 
numbers were first paired with eight 
randomly selected syllables for each 
subject, and then the remaining two 
syllables were assigned to different 
but randomly selected numbers for 
each subject. 

The subjects, run individually, were 
instructed to learn a list of 10 non- 
sense syllables and their associated 
number responses, the response al- 
ternatives being either one, two, or 
three (or one, two, ..., seven, or 
eight), and that the syllables and num- 
bers had been paired off randomly. 
They were asked to respond within 


#This experiment was carried out with 
the assistance of Takao Umemoto. 


2 or 3 seconds after the stimulus card 
was shown, and to guess a number if 
they didn’t know the answer. The 
result of this practice is that the ex- 
posure time cannot be specified ex- 
actly; however, it was noted that the 
subjects followed instructions and re- 
sponded within 2 or 3 seconds in the 
large majority of cases. 

For all three training conditions, if 
the subject gave the correct number to 
a card, the experimenter repeated it, 
e.g., “three is correct.” The three 
conditions differed on trials when the 
subject responded incorrectly: in the 
correction conditions, the experimenter 
said the correct number; in the non- 
correction conditions, the experimenter 
said “Wrong” following errors; and 
in the partial correction condition, the 
experimenter said two numbers fol- 
lowing errors, e.g., “one or three is 
correct.” The subjects in the partial 
conditions were informed that only one 
of the numbers the experimenter said 
would be correct and the other number 
was a distractor (chosen at random) 
and that half the time the correct num- 
ber would be said first and half the 
time it would be said second. The 10 
stimulus cards were shuffled for 15 
seconds between runs through the 
deck. The entire list was repeated 
until the subject went through two 
consecutive cycles without any errors. 

The numbers of subjects in each 
group were 14 in the 3-C, 8-C, and 
8-NC groups; 15 in the 3-P and 8-P 
groups; and 16 in the 3-NC group. 
Assuming that each item and each 
subject may be characterized by the 
same value of c, the data consist of 
140, 150, or 160 sequences of correct 
and incorrect responses for the vari- 
ous experimental conditions. Before 
the model could be evaluated, the 
learning rate constant, c, had to be 
estimated. The data from the 3-C 
group were selected for this purpose; 
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the least squares estimate of c = .218 
was obtained by fitting the learning 
curve (i., percentage correct vs. 
trials) for this group. 

The first predictions concern the 
average total errors per item before 
achieving the learning criterion. The 
predictions for the correction and non- 
correction groups were obtained by 
summing Equations 1 and 3, respec- 
tively, over trials. The predictions for 
the partial groups were obtained by 
calculating the average errors before 
the first success (using c/2 as the rate 
constant) and the average errors fol- 
lowing the first success (using c as the 
rate constant) and then adding these 
two numbers. The observed means 
(M) and standard errors of the means 
(oy) are shown in Table 1 along with 
the predicted means (P). 

It is clear from Table 1 that the 
theory performs adequately in predict- 
ing average total errors using the 
single estimate of c. In all six cases 
the predicted value falls within one 
standard error of the observed mean.° 
The differential effect of N upon error 
rate is amplified as one proceeds from 
the correction through the partial to 
the noncorrection training procedures. 

A second indication of the goodness 
of fit of the theory is shown graphically 
in Figures 1 and 2 which present ob- 
served and predicted values for cumu- 
lative errors as a function of trials. 
The smooth curves connect the pre- 
dicted values while the unconnected 
points are the empirical values. The 
predicted values for the correction and 
noncorrection conditions were obtained 


5The standard errors for the 3-C and 
&-C groups can be predicted by the theory 
and turn out to be somewhat larger than 
the observed values (predicted values were 
.24 and .30 for 3-C and 8-C groups, respec- 
tively). Deriving comparable variance pre- 
dictions for the partial and noncorrection 
conditions presents complex mathematical 
problems which have not been solved to date. 


TABLE 1 
AVERAGE TotaL NUMBER OF ERRORS PER 
ITEM OBSERVED AND PREDICTED 








Response alternatives 





Training 

condition 3 8 
M = 2.94 
P = 3.03 
om = .19 


M = 3.74 
P = 3.88 
om= .21 


M = 5.65 
P = 5.59 
om = .34 





M = 3.95 
P = 4.00 
om = .25 


M = 5.95 
P = 5.80 
om = .35 


M = 9.42 
P = 9.54 


om = .61 


Correction 


Partial 


Noncorrection 





by cumulating successive values of 
error probabilities calculated from 
Equations 1 and 3, respectively. The 
theoretical curve for the partial cor- 
rection conditions is not expressible 
in a simple equation but successive 
values can be obtained. The first sev- 
eral values will be derived here to il- 
lustrate the procedure. Define Q, as 
the probability that the item is not yet 
conditioned by the beginning of Trial 
n. The probability of an error on 
Trial n would then be (1 — 1/N)Q,. 
The first several values of Q, for 
the partial correction conditions are: 


by assumption 


Qi=1 
= 51-9 + (1-4) 
x (1-5) 
= yl -or+(1-5) 


x (1 _ a —o) 


c 


2 


[4] 
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Fic. 1. Average cumulative errors per item plotted as a function of trials for subjects 
having three response alternatives. (The smooth curves represent predicted values; the 
dots near a curve represent the corresponding observed values.) 
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TRIAL 
Fic. 2. Average cumulative errors per item plotted against trials for subjects having 
eight response alternatives. (The smooth curves represent predicted values; the dots near 
a curve are the corresponding observed values. ) 
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The value of Q, is the sum of two 
probabilities: (a) that a correct guess 
occurs on Trial 1 but conditioning 
failed to occur with probability 1 — c, 
and (b) that subject guessed incor- 
rectly on Trial 1 and fails to condition 
the correct response with probability 
1—c/2. The value of Q, is the sum 
of the joint probabilities of three 
events: (a) correct on Trial 1, fails to 
condition on Trials 1 and 2; (b) error 
on Trial 1, success on Trial 2, fails to 
condition after either trial; (c) errors 
on both trials, not conditioned on 
either trial. The general rule for ob- 
taining Q, is that prior to the first 
correct response the conditioning prob- 
ability is c/2 and that on the trial of 
the first correct response and there- 
after the conditioning probability is c. 
By multiplying these Q, values by 
1 — 1/N and summing them, the pre- 
dicted curves for the partial correction 
conditions are obtained. As Figures 
1 and 2 show, for five of the six 
groups the fit of predicted to observed 
values is satisfactory; the correspond- 
ence to the 8-NC points would not be 
impressive were it not for the fact 
that the same c constant was used in 
generating all the curves under appro- 
priate boundary conditions. 
Equations 2 and 3 for the noncor- 
rection conditions were derived with 
the assumption that no change resulted 
from nonreinforced trials. To the 
extent that these equations adequately 
describe performance in the noncor- 
rection conditions, the assumption of 
no change is supported. However, 
additional data bearing upon this as- 
sumption may be obtained by analysis 
of the early trials before learning has 
occurred. Beginning with the first 
trial, consider the sequence of proba- 
bilities of a correct response follow- 
me OS, f,'2, . consecutive errors. 
As soon as a subject responds cor- 
rectly to a given item, that item is 


TABLE 2 
PROBABILITY OF A SUCCESS FOLLOWING n 
CONSECUTIVE ERRORS FROM THE FIRST 
TRIAL FOR THE NONCORRECTION 
Groups 


3-NC 


34 
31 
.28 
31 
31 
37 





dropped from later computations. If 
the randomization of responses to 
stimuli has been effective, then the first 
value of this series should be close to 
1/N, the probability of guessing the 
correct number by chance. Moreover, 
if the assumption of no change is cor- 
rect, then the probabilities of a success 
following 1, 2, 3, .. . consecutive 
errors should remain around 1/N for 
the noncorrection groups, deviating 
from this value only by random sam- 
pling fluctuations. The results of this 
analysis are shown in Table 2 for the 
noncorrection groups. It should be 
realized that going down the column 
these estimates become progressively 
less reliable because the number of 
cases involved (beginning with 160 or 
140) decreases. These series of esti- 
mates for both three and eight response 
alternatives reveal no significant trends 
away from the first value. Hence, 
the results provide additional support 
for the assumption of no change fol- 
lowing nonreinforced trials in the non- 
correction ccnditions. 

An analysis related to that above 
can be given for the correction and 
partial correction groups. Because the 
success rate was higher for these 
groups, there was insufficient data to 
obtain reliable estimates of single-trial 
probabilities such as shown in Table 2 
for the noncorrection groups. How- 
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TABLE 3 


PREDICTED AND OBSERVED CONDITIONAL 
PROBABILITIES OF SUCCESS FOLLOWING 
AN ERROR ON THE PRECEDING TRIAL 


Response alternatives 


Te . . ees Mis 
raining 
condition 3 8 


| Observe Predict Observe Predict 
Correction | 48 . | ae. ae 
Partial 
Before first | 
success 
After first 
success 


ever, this difficulty may be circum- 
vented by considering over all trials 
the probability of a success given an 
error on the preceding trial. For the 
correction group, this probability 
should be a constant, c+ (1—c)p, 
where /, is the probability of guessing 
correctly. For the partial correction 
conditions, the probability of a suc- 
cess following an error should be c/2 
+(1—c/2)p. before the first suc- 
cess and should be ¢c + (1 — c)p, after 
the first success. Such predictions are 
very sensitive to small changes in pp. 
Our efforts to randomly assign re- 
sponses to stimuli were directed to- 
wards making p, be close to 1/N in 
value. However, with even a mod- 
erate-sized sample, small deviations of 
the actual probability of correct guess- 
ing inevitably occur. Hence, a fairer 
test of the model (rather than a test 
of the randomization procedure) is 
provided by predictions based on em- 
pirical estimates of p,. Using the 
first-trial estimates of ~, (which on 
the whole were close to 1/N), the 
predicted probabilities of success fol- 
lowing an error were obtained and are 
compared with the observed values 


H. Bower 


in Table 3. The correspondence be- 
tween predicted and observed values is 
quite satisfactory; we may note that 
the rather complex assumptions about 
conditioning in the partial correction 
conditions receive substantial support 
from these data. 

Another way to test the assumptions 
of the model involves predicting the 
number of errors before the first suc- 
cess. Since for the partial and non- 
correction groups the stochastic proc- 
ess is not complicated prior to their 
first success, it is possible to predict 
both the mean and variance of this 
statistic for these groups. Define F, 
to be the number of errors before the 
first success, and p, to be the prob- 
ability of a correct guess. Then it 
can be shown (Bower, 1961) that the 
mean and variance of F, for the non- 
correction groups are: 


_1=% 


ie fe 
Po 


’ | ae pe? : [5] 
Similarly, for the correction groups, 
the mean and variance of F, are: 


M Sl 
hy + (1 — po)’ 


V= M+ M*(1 — 2c) 


[6] 


The expressions for the partial cor- 
rection groups are the same as in 
Equation 6 except with ¢ replaced by 
c/2. 

The predicted values of the mean 
and standard error of F, for the six 
groups are shown in Table 4 along 
with the observed means and standard 
For the noncorrection groups, 
fo was estimated as the average of the 
values in Table 2 since the theory says 
those values represent random fluc- 
tuations around pp». 

Inspection of Table 4 shows that in 


errors. 
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the six cases the predicted mean F, 
is within a standard error of the ob- 
served mean, and the predictions of 
the standard errors are also close to 
the observed standard errors. In gen- 
eral, the average F, increases with N 
and the differential effect of N is 
amplified under the partial and non- 
correction training conditions. 

It should be noted that at an em- 
pirical level N has its primary influ- 
ence upon errors before the first suc- 
cess, and has practically no influence 
upon errors following the first success. 
The reader may convince himself of 
this fact by subtracting the mean F, 
values in Table 4 from the mean total 
error values in Table 1. At first 
glance, this observation seems to in- 
validate the model by showing that N 
only influences the discovery stage but 
not the fixation stage of learning after 
the first correct response has occurred, 
In fact, however, theoretical predic- 
tions of average errors following the 
first success correspond closely to the 
observed values; of course, this must 
be so since the theory predicts F, and 
total errors reasonably well. The rea- 
son we expect errors following the 
first success to be relatively constant 
over JN is that, according to the theory, 
a sampling bias is involved in com- 
paring the three and eight alternative 
conditions at this point. Errors fol- 
lowing the first success will occur only 
for items whose first correct response 
came about by guessing, and for these 
guessed correct items we expect more 
errors with eight than with three re- 
sponse alternatives. Let gy represent 
the probability that with N alternatives 
the first correct response comes about 
by guessing and let ey represent the 
average number of subsequent errors 
for an item whose first correct re- 
sponse occurred by guessing. Then 
the expression for the average number 
of errors following the first correct 


response (call it wy) is: 


Wy = grey + (1 — gy) °0 
= 9nen [7] 


That is, a proportion gy of first cor- 
rect responses come about by guess- 
ing and we expect an average of ey 
more errors for these items; a propor- 
tion 1—gy of the first correct re- 
sponses come about by conditioning on 
the preceding trial and for these items 
we expect zero subsequent errors. Ac- 
cording to the theory, e, will be larger 
than e,; however, g, will be smaller 
than g,; that is, given a first correct 
response, it is more likely to have 
occurred by guessing when N is three 
than when N is eight. Therefore, the 
fact that the products g,e, and g,e, are 
approximately equal is neither surpris- 
ing nor a refutation of the theory. 


DISCUSSION 


The overall results provide strong 
support for the validity of the model 
and the specific hypotheses about ex- 


TABLE 4 
OBSERVED AND PREDICTED MEAN AND 
STANDARD ERROR OF THE NUMBER 
OF ErRrorS BEFORE THE FIRST 
SUCCESS 


Response alternatives 


Training 
condition 3 8 


M = 1.33 M = 2.25 
F = 1,39 F 37 
ou = .092 
or = .125 


M = 2.12 
F .16 
om .192 
oF 191 


M = 2.17 
F = 2.15 
om .201 
or = .206 


Correction 





Partial 
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perimental variables. The data of six 
independent groups were adequately 
reproduced using a single estimate of 
the learning rate constant obtained 
from one of these groups. The fact 
that in 24 independent predictions the 
model came close to the data is suf- 
ficient justification for exploring fur- 
ther consequences of the theory. 

The initial problem that led to this 
investigation was whether the number 
of response alternatives could be 
shown to affect learning rate on rein- 
forced trials in addition to contributing 
to differential error probabilities due 
to guessing on unlearned items. The 
data supported the assumption that 
learning rate on reinforced trials was 
a constant independent of the number 
of response alternatives, and that the 
effect of N upon error rate could be 
attributed to differential guessing 
probabilities on unlearned items. It 
should be noted that these conclusions 
are valid only for the experimental 
procedure used here in which the re- 
sponse alternatives are immediately 
available to the subject. For alterna- 
tive procedures in which the subject is 
required to learn the responses (e.g., 
three or eight nonsense syllables) as 
well as the S-R associations, a number 
of complicating factors enter to ob- 
scure the picture and prevent resolu- 
tion of the basic problem. 

The second problem of this study 
was to account for the effect on per- 
formance of several training proce- 
dures. One may conceive of the cor- 
rection, partial correction, and non- 
correction procedures as varying in 
the degree to which the correct re- 
sponse is specified following an in- 
correct response by the subject. This 
variable in turn determines the prob- 
ability that the correct response is re- 
inforced following incorrect responses. 
Thus, in the correction condition, the 


reinforcement probability is 1 on every 
trial; under partial correction, the re- 
inforcement probability is 1/k before 
the first correct response and is es- 
sentially 1 afterwards due to a high 
recognition probability ; under noncor- 
rection, the probability that the cor- 
rect response is reinforced is essen- 
tially zero following errors. The data 
supported these hypotheses and gave 
additional support to the assumption 
that when the correct response was re- 
inforced, the probability of condition- 
ing was a constant, c, the same for all 
three conditions. 

In addition to confirming these spe- 
cific assumptions about N and training 
conditions, the results lend support to 
the learning model in which these as- 
sumptions are embedded. In this one- 
element model, the probability of the 
correct response can have only two 
values, 1/N or 1. The stimulus ele- 
ment begins in the unconditioned state 
and each reinforced trial provides an 
opportunity for the element to become 
associated in all-or-none fashion with 
the correct response. When condi- 
tioning finally occurs, response prob- 
ability jumps from 1/N to 1. Discon- 
tinuous learning theories have been 
contrasted frequently with “response 
strength” theories which assume that 
an associative factor (habit strength 
in Hull’s theory, proportion of condi- 
tioned elements in Estes’ linear model ) 
increases in a cumulative manner with 
successive reinforcements. According 
to a strength theory, response prob- 
ability make take on a large (possibly 
infinite) number of values ranging 
from 1/N up to 1. It is frequently 
difficult to distinguish this theory from 
the one-element model since they pre- 
dict the same average learning curve 
over a group of subjects and items. 
Indeed, the present results on average 
errors could be fit about equally well 
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by the linear model. Since the dis- 
cussion has been oriented towards the 
one-element model, perhaps it would 
be appropriate to record a few extra 
results which favor this model. For 
these purposes, let us consider two 
sequential statistics for which the two 
models deliver qualitatively different 
predictions. 

The first statistic is the average 
number of errors (to perfect learning) 
following an error that occurs on Trial 
n of the experiment. The one-element 
model makes the rather counterintui- 
tive prediction that this average num- 
ber of subsequent errors is a constant 
independent of the trial number on 
which the leading error occurs. Thus, 
if we observe an error on Trial 10 we 
predict the same number of subsequent 
errors as if we had observed that error 
on Trial 1. The point of the matter is 
that when an error occurs on Trial n 
we know that the item was not condi- 
tioned before Trial nm, and we can set 
the clock back to Trial 1 as far as the 
model is concerned in predicting future 
errors on that item. In contrast, the 
associative strength approach (speci- 
fically, the linear model) predicts that 
the number of errors following an 
error on Trial m is a decreasing func- 
tion of n; that is, the greater the num- 
ber of reinforced trials before a par- 
ticular error, the higher the associative 
strength at that time, and hence the 
fewer the number of subsequent errors 
before perfect learning. 

To obtain a sizable sample on 
which to test this critical point, data 
from 48 subjects in another 10-item 
experiment ® (with 
N = 2) were pooled with the 3-C and 
8-C groups of the present experiment. 
The varying N’s will not affect the 


paired-associate 


6 These experiments were carried out 
with the assistance of Norman Karns and 
James Colloran. 


AVE. ERRORS FOLLOWING AN ERROR ON TRIAL an 








3 4 
TRIAL # 

Fic. 3. Paired-associate data: average 
errors following an error that occurs on 
Trial n, for n=1, 2,..., 6. (Data points 
are filled dots; predictions of linear and 
one-element models are indicated.) 


results on constancy vs. monotone de- 
creasing aspects of errors following an 
error on Trial n. Using this pool of 
760 response sequences (10 items for 
each of 76 subjects), the distributions 
of the number of errors following an 
error occurring on Trial 1, on Trial 2, 

. , on Trial 6 were obtained. The 
analysis was not carried beyond Trial 
6 since the number of cases involved 
was decreasing so that estimates of 
means would be more unreliable. The 
estimates of the average number of 
errors following an error on Trial n 
are shown in Figure 3. 

These data clearly favor the con- 
stancy prediction of the one-element 
model. Although the estimates fluc- 
tuate somewhat this can be attributed 
to sampling variability; even the 
largest difference (2.05 vs. 1.95) does 
not approach statistical significance 
(t=0.49). Also in Figure 3 is 
shown the rough order of magnitude 
of the numbers to be expected from 
the linear model. These predicted 
numbers are not exact because of dif- 
ferent learning rates in the two ex- 
periments that were pooled. How- 
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ever, the average c value was .25 and 
this value was used to obtain the 
values on the graph. Clearly the 
curve predicted by the linear model is 
quite discrepant from the data; ac- 
cordingly, the one-element model ap- 
pears to give a more adequate descrip- 
tion of these data. 

A second example of this constancy 
effect is shown in Figure 4 which pre- 
sents similar data collected in an ex- 
periment on verbal discrimination 
learning. In this experiment 34 sub- 
jects learned 20 items, an item con- 
sisting of two different nonsense syl- 
lables printed on a card. The subject’s 
task was to read off each card that syl- 
lable arbitrarily designated as correct 
for that item; the subject repeatedly 
went through the items in random 
order until he achieved two consecu- 
tive errorless trials with the entire list. 
Since response learning per se is not 


involved in this situation, the one-ele- 
ment model was expected to apply. 
Since there were two response alterna- 
tives on each card, we make the nat- 
ural assumption that the chance prob- 
ability that subject selects the correct 
syllable is one-half. Detailed com- 
parisons of predicted and observed 
statistics indicated excellent fit of the 
model; the results in Figure 4 are 
representative of the overall accuracy 
of the predictions for this set of data. 
An additional set of paired-associate 
learning data providing a comparison 
of the linear and one-element models 
has been reported in a previous paper 
(Bower, 1961); there again the one- 
element model was clearly superior to 
the linear model in predicting details 
of the data. 

A second measure suggested by 
Estes (1960) for differentiating the 
one-element and linear models con- 
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Fic. 4. Verbal discrimination data: 
Trial », for s=1, 2,..., 6. 
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average errors following an error that occurs on 
(Data points are the dots: predictions of linear and one- 
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cerns the probabilities of repetition 
and alternation of responses over a 
series of test trials following training. 
Consider a miniature experiment con- 
sisting of one or two reinforced trials 
followed by two test trials (T, and 
T,) on which the subject is not in- 
formed whether his response is right 
or wrong. The comparison differ- 
entiating the two theories is the prob- 
ability of correct on T, given a correct 
and incorrect response, respectively, 
on T,. According to the one-element 
model, the probability of correct fol- 
lowing incorrect on T, should be 
around the chance guessing level 
(1/N); the probability of correct fol- 
lowing correct on T, will be much 
higher, being somewhat less than one 
because we may expect some of the 
correct responses on T, to have oc- 
curred by guessing. The predictions 
of these conditional probabilities by 
the linear model depends upon 


whether there are substantial subject 


and/or item differences in learning 
rate. If for the moment we neglect 
such differences, as we have in testing 
the one-element model, then the linear 
model implies that the conditional 
probabilities of success following 
either a success or failure on T, will 
be equal. 

Two experiments will be reported to 
this point. The first, performed with 
the assistance of Sharon Gadberry, 
consisted of 36 subjects learning two 
10-item lists (nonsense syllables), the 
responses being the integers 1 to 10. 
Following two presentations of each 
S-R pair, the subjects received two 
test trials with the stimulus member 
of each pair. The pooled results of 
this experiment are shown in the first 
row of Table 5. Starting with 720 
cases (36 subjects on two 10-item 
lists), the probability of correct on T, 
was .586; of those correct, 89.6% were 


TABLE 5 


Resu_ts OF Two EXPERIMENTS ON THE 
PROBABILITY OF CORRECT ON T? 
FOLLOWING CORRECT AND 
FAILURE ON T, 








Per- Per- 
centage | centage 
correct 

after 
correct 


| 
er- 
—=—s = “ 
Experiment of 8 
correct 


on Ti 


I. Nonsense 
syllables, 10 
number re- | 
sponses 


. Noun-noun 








correct on T,; of those incorrect on T,, 
only 16.3% were correct on T,. 
These results support the one-element 
model. Two comments are required 
here: First, the value of .896 for cor- 
rect following correct on T, is about 
what would be expected if 10% of 
the correct responses on T, had oc- 
curred by guessing; secondly, the 
value of .163 for correct following 
incorrect on T, is a little higher than 
the chance level of .100; however, this 
is likely an artifact of assigning stim- 
uli aad responses in one-to-one corre- 
spondence so that when k items are 
learned the probability of guessing 
correctly on the unlearned items may 
be close to one in N — k instead of the 
a priori value of one in N. 

The difference between probabilities 
of success on T, following a success 
and failure on T, should be increased 
above those in the previous study if 
one requires the responses per se to be 
learned (e.g., as in word-word pairs). 
Under these circumstances, the chance 
level of guessing will be practically 
zero. Thus, if the subject fails to re- 
spond correctly on T,, the probability 
of being correct on T, is essentially 
zero; by the same reasoning, if a cor- 
rect response occurs on T, it is most 
likely to have come about by condi- 
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tioning so that the probability of cor- 
rect on T, for this item should be 
essentially unity if forgetting is neg- 
ligible in this situation. Row two of 
Table 5 reports the results of an ex- 
periment performed under these con- 
ditions with the assistance of Judith 
Slagter. Thirty-two subjects received 
three presentations on each of 10 noun- 
noun pairs (e.g., moon-pin) followed 
by two test trials. The percentage 
correct on T, was substantially higher 
than in the first experiment with non- 
sense syllable-number pairs. How- 
ever, the important points of interest 
are that the probability of correct on 
T, given a correct on T, is essentially 
unity, whereas the probability of cor- 
rect following failure on T, is nearly 
zero. 

One objection frequently raised to 
these foregoing comparisons is that if 
there are substantial subject and/or 
item differences in learning rates, then 


arguments based upon the pooled ag- 
gregate will tend to favor the one- 


element result. The point of the ob- 
jection is this: if the linear model 
holds, and if differences in learning 
rates lead to a distribution over sub- 
jects of response probabilities, then 
considering the entire aggregate the 
conditional probability of a success fol- 
lowing a success is expected to be 
higher than the conditional probability 
of success following a failure. This 
happens, according to the argument, 
because when we conditionalize upon 
the first success we are selecting pre- 
dominately those protocols from the 
upper end of the distribution of re- 
sponse probabilities. However, grant- 
ing that this argument is sound, it 
nevertheless helps not at all in res- 
cuing the linear model from such data; 
it simply shifts the focus of the argu- 
ment from conditional probabilities to 
joint probabilities of pairs of re- 


sponses on T, and T,. For conveni- 
ence in the following, let p,, represent 
the joint probability of a success on T, 
and a failure on T,, and let p,, be the 
joint probability of failure on T, and 
success on T,. It is a simple matter 
to show that if the linear model holds, 
then ~,, should equal p,, regardless of 
possible differences between subjects 
and/or items in learning rates or initial 
probabilities. Data from four studies 
previously published by Estes (1960, 
1961) show that the required identity 
of p,) and p,, fails to appear in any of 
the four studies. For example, in one 
study (Estes, 1960, Figure 2, p. 215) 
the value of p,, was .132 while p,, was 
.046; in another study (Figure 6, p. 
218) ~,. was .098 while p,, was .003. 
A variety of ad hoc hypotheses about 
individual differences to supplement 
the linear model have been considered 
in a paper by Estes (1961); the out- 
come of those investigations was that 
no simple ad hoc supplements could 
bring the linear model into corre- 
spondence with these elementary data 
which were collected to test it directly. 

The question about individual dif- 
ferences raised by the objection above 
can be answered by comparison with 
the variability expected from the 
model. Suppose this question is con- 
sidered in the context of a more ex- 
tended experiment such as that de- 
scribed for the 3-C group in the pre- 
ceding pages. A summary measure of 
an individual’s performance might be 
the total errors (7) he makes over the 
10 items. If we assume within the 
model that all subjects and items are 
characterized by the same learning 
rate constant, c, then a subject’s T 
score represents the sum of 10 values 
sampled randomly and independently 
from a distribution which allegedly is 
the same for all subjects and items. 
The theoretical variance of these T 
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scores may be calculated using our 
single estimate of c; this predicted 
variance may then be compared with 
the variance of the observed 7 scores 
by taking their ratio which will be 
distributed approximately as the F sta- 
tistic. If the observed variance of T 
scores is much larger than that pre- 
dicted, then we would tend to at- 
tribute the inflated variance to varia- 
tions in ¢ over subjects. This test has 
been applied to several sets of paired- 
associate learning data collected by the 
writer, including the data from the 
3-C and 8-C groups of the present 
study for which the variance of T 
could be predicted. With the popula- 
tion of college students who have 
served as subjects in these various ex- 
periments, this test statistic has al- 
ways yielded nonsignificant F values ; 
that is, the variance of T between sub- 
jects does not differ significantly from 
what would be expected on the basis 


of random sampling from the stochas- 
tic process assumed by the model to 


be common to all subjects. A similar 
statement can be made about anal- 
ogous tests for differences in item 
difficulty ; this outcome was expected, 
of course, since precautions were 
taken in selecting stimuli which ap- 
peared to be equal in intralist simi- 
larity and association value. Alterna- 
tive procedures for handling this ques- 
tion regarding subject and/or item 
differences in the RTT experiment 
were considered in a paper by Estes, 
Hopkins, and Crothers (1960). 

In concluding this report, it might 
be appropriate to add a few general 
comments concerning the issues under 
discussion lest some misunderstand- 
ings arise. First, it is misleading to 
cast the issues under discussion at the 
of all-or-none 
response-strength theories of learning ; 


level theories versus 


rather the point at issue is whether the 


appropriate model for the present ex- 
perimental situation is a two-state or 
a multistate process, where “state” 
here refers to a particular value of re- 
sponse probability. The continuous 
linear model is the limiting case of 
the class of multistate models. The 
all-or-none assumption about condi- 
tioning with respect to the available 
stimuli is not a differentiating feature 
of these two classes of models. Since 
the early writings of Guthrie (1930) 
it has been clear that the all-or-none 
conditioning assumption would imply 
gradual and cumulative changes in re- 
sponse probabilities provided there is 
sufficient variability in the stimulus 
samples from trial to trial. Within 
the framework of statistical learning 
theory, the feature differentiating the 
two classes of models is how many 
independent stimulus components one 
must assume to represent adequately 
the course of learning in this or that 
experimental situation. The two-state 
model proposed here for elementary 
association learning assumes that each 
item may be represented by a single 
stimulus element within the model and 
that this element can be in either of 
two states. That the number of ele- 
ments is the critical assumption is 
illustrated by the fact that the statis- 
tics in Figures 3 and 4 and in Table 5 
would not discriminate qualitatively 
between the linear model and a two- 
element model (i.e., where each item 
is represented by two stimulus com- 
ponents, with a randem one being 
sampled on each trial). For more ex- 
tensive discussions of small element 
models the reader may refer to a 
paper by Estes (1959) or a book by 
Suppes and Atkinson (1960). 
Secondly, it should be recognized 
that in this report we have demon- 
strated only that the one-element 


model adequately describes results 
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from elementary _ paired-associate 
learning experiments. The job of ex- 
tending the range of applicability of a 
theory is an empirical project which 
must proceed piecemeal. In large 
part, the success in applying the model 
to a given learning situation will de- 
pend upon the simplicity of the situa- 
tion and the degree of experimental 
control over stimulus variables. To 
cite a pertinent illustration: in training 
a rat to shuttle in response to a buzzer 
to avoid shock in a Mowrer-Miller 
shuttle box, conditioning is typically 
a gradual, “multi-stage” process (cf. 
Mowrer, 1960, p. 36); however, if 
experimental conditions are drastically 
simplified and precautions are taken 
to eliminate possible sources of inter- 
fering responses, then avoidance con- 
ditioning is a one-trial, “two-state” 
process (Maatsch, 1959). An anal- 


ogous illustration for the eyelid condi- 
tioning situation has been reported by 


Voeks (1954). 

In contrast to those situations in 
which a two-state behavioral process 
may possibly be achieved through ex- 
perimental control, there is a large 
number of common learning situation 
for which such precise stimulus con- 
trol probably can not be achieved, and 
it is unlikely that a simple two-state 
model could apply. These situations 
may be characterized generally as ones 
in which exposure to discriminative 
stimuli is subject-controlled; these sit- 
uations range from those in which the 
critical cues are proprioceptive stimuli 
from the subject’s current behavior, as 
in motor skill learning, to situations in 
which selected components of a com- 
plex stimulus array control behavior 
through the mediation of an overt or 
implicit observing response, as in com- 
plex concept learning. One example 
of this latter class would be that of 
a rat learning a black-white discrimi- 


nation in a T maze; the stimuli effec- 
tive at the moment of choice are mul- 
tiple and vary from trial to trial de- 
pending upon the subject’s vicarious 
trial and error behavior. For such 
situations more elaborate conceptual 
apparatus (cf. Audley, 1960; Bower, 
1959; Spence, 1960) is required to 
analyze the interaction between ob- 
serving behavior and _ instrumental, 
goal-directed behavior. 
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Testing the null hypothesis, Ho, against 
alternatives, H,, is well established and 
has a proper place in scientific research. 
However, this testing procedure, when 
it is routinely applied to comparing 
experimental outcomes with outcomes 
that are quantitatively predicted from a 
theoretical model, can have unintended 
results and bizarre implications. This 
paper first outlines three situations in 
which testing Ho has conventionally been 
done by psychologists. In terms of the 


probable intentions or strategy of the 
experimenter testing H» turns out to be 
an appropriate tactic in the first situa- 
tion, but it is inadequate in the second 
situation, and it is self-defeating with 
curious implications in the last situation. 


Alternatives to this conventional pro- 
cedure are then presented along with the 
considerations which make the alterna- 
tives preferable to testing the usual Ho. 


THREE APPLICATIONS OF Ho TESTING 

Probably the most common applica- 
tion of the tactic of testing Ho arises 
when the independent variable has pro- 
duced a sample difference or set of 
differences in the magnitude of the de- 


'The author is indebted to Arnold M. 
Binder of Indiana University whose argu- 
ments inspired him to make explicit some of 
the issues involved in using conventional 
analysis of variance procedures in testing the 
adequacy of a theoretical model. As this 
paper went through various revisions over a 
period of time the writer is correspondingly 
indebted to a number of supporting agencies: 
the Graduate Research Committee and the 
College of Letters and Science of the Uni- 
versity of Wisconsin, the National Science 
Foundation, and finally to the Department of 
Psychology of the University of California, 
Los Angeles, his host during final preparation 
of the manuscript. 


pendent variable. Quantitative predic- 
tions of the size of the difference or 
differences are not available. The ex- 
perimenter wishes to know whether or 
not differences of the size obtained could 
have occurred by virtue of the operation 
of the innumerable nonexperimental 
factors conventionally designated as 
random. He sets up Ho that the differ- 
ences are zero; chooses a significance 
level, a; determines the set of hypotheses 
alternative to Ho that he is willing to 
entertain, H;; selects an appropriate test 
statistic, t, F, x?, U, T, or the like; and 
proceeds with the test. Rejection of Ho 
permits him to assert, with a precisely 
defined risk of being wrong, that the 
obtained differences were not the product 
of chance variation. Failure of the test 
to permit rejection of Ho, which, un- 
fortunately, is commonly termed “ac- 
cepting’’ Ho, means that the obtained 
differences or greater ones would occur 
by chance with a probability greater than 
a. This situation is straightforward. 
The experimenter has limited aims. He 
has asked a simple question, and he has 
received a simple answer, subject only to 
those ambiguities which attend all ex- 
perimental and inductive inference. His 
tactics are admirably suited to his stra- 
tegic objective. 

Another common but less satisfactory 
instance of testing Ho arises when the 
results of pre-experimental matching or 
pretesting are to be evaluated. Here 
the experimenter has measured the 
dependent variable or some related vari- 
able before operation of the independent 
variable, and he devoutly hopes that 
the experimental and control groups are 
alike except for random differences. He 
is now relieved or chagrined, depending 
upon whether Ho is “accepted” or 
“rejected” as a consequence of his test. 
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Even if Ho is accepted his relief is tem- 
pered by some uneasiness. He knows 
that he has not proved, and indeed can- 
not prove, that Ho is “true.” His 
tactics in testing Ho seem to be appro- 
priate to the impossible strategic aim of 
proving the truth of Ho. Certainly, if 
he had a more reasonable aim he has 
adopted inappropriate tactics. Utilizing 
these tactics, the best he can do is to 
beat a strategic retreat, and if Ho is 
accepted he can perhaps point out that 
he has used a very powerful test and that 
if there were real differences they were 
most likely very small. Although psy- 
chologists have never to my knowledge 
done so, he might be able to go one step 
further and point out that his testing 
procedure would reject Ho a given per- 
centage of the time, say, 90%, if the 
“‘true’’ difference had been as little as, 
say, one-tenth of an SD. This sort of 
statement of the power of a test is a 
commonplace in acceptance inspection 
(Grant, 1952, Ch. 13). 

With the advent of more detailed 
mathematical models in psychology (e.g., 
Bush, Abelson, & Hyman, 1956; Bush & 
Estes, 1959; Goldberg, 1958; Kemeny, 
Snell, & Thompson, 1957) a new sta- 
tistical testing situation is arising more 
and more frequently. The specificity of 
the predictions and perhaps the whole 
philosophy behind model construction 
pose a different kind of statistical prob- 
lem than those faced by most psycho- 
logical investigators in the past. It 
seems obvious that as the use of models 
becomes more widespread a_ greater 
number of investigators will face the 
problem of evaluating the correspondence 
between empirical data points and pre- 
cise numerical predictions of these points. 
Unfortunately most of the procedures 
used to date in testing the adequacy of 
such theoretical predictions set rather 
bad examples. Probably the least ade- 
quate of these procedures has been that 
in which an Hg of exact correspondence 
between theoretical and empirical points 
is tested against H, covering any dis- 
crepancy between predictions and ex- 
perimental results. 

Most models predict a considerable 
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Fic. 1. Idealized situation involving the 
test of a theoretical function, Y’ = f(X). 
(Theoretical points, Y’;, are represented by 
open circles; obtained means, f. are repre- 
sented by solid circles.) 


number of different aspects of the data, 
and some of these aspects are predicted 
with greater success than others (Bush & 
Estes, 1959, Chs. 14, 15, 17, 18). We 
shall restrict our discussion to the pre- 
diction of values along a curve which 
might be a learning curve. An idealized 
version of such a typical situation is 
presented in Figure 1. Here, the de- 
pendent variable, Y, is plotted on the 
vertical axis against the independent 
variable, X, on the horizontal axis. The 
theoretical model has led to an expression, 
Y’ = f(X), giving a set of k theoretical 
predictions, Y’;, Y's, «++, ¥’%. The ex- 
periment has produced k empirical data 
points, a set of mean values, YP), Po, ---, 
Y,., corresponding to the values of the 
independent variable that were investi- 
gated, namely, X,, Xo, +++, Xx. Indi- 
vidual observations tend to form normal 
distributions about each of the Yj, and 
these normal distributions tend to have 
equal o’s for all data points. In further 
discussion we shall assume that inac- 
curacies in the manipulation of the 
independent variable, X, can be ignored. 
The problem now is to investigate the 
goodness of fit of the Y’; to the VP; or 
the correspondence between the Y’; and 
the Y;. 

The tactics oriented toward accepting 
Ho as corroborating the theory involve 
breaking down the jth individual obser- 
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vation from the general mean of all of the 
observations, as follows: 


Yi; — F = (Yi; — P,) 
+ (F; s Y’;) + (Y’; ae Y) (1) 


where Y;; is the jth observation in the ith 
normal distribution, and Y is the general 
mean of all observations. 


the total sum of squares may then be 
partitioned as follows: 


SStot = SSpev Est 
+- SSpev Theory + SStTheory [2] 


where SSpev Est is the sum of squares asso- 
ciated with the variation of individual 
measures from their means, SSpev Theory is 
the sum of squares associated with the 
systematic departures of empirical data 
points from the theoretical points, and 
SStheory is the sum of squares associated 
with departures of the theoretical points from 
the general mean of the whole experiment. 


If we suppose that the linear model for 
the analysis of variance holds, then: 


Yip = et Tit Dit aij 


[3] 


where u is the population mean for all Y;; 
over the specific values of the independent 
variable, X;; 7; is the departure of the 
“true” theoretical value of Y’; from yu; D; is 
the discrepancy of the “true” value of Y; 
from the true value of Y’;; and e; is a 
random element from a normal distribution 
with a mean of zero and variance, o/, 
for all i. 


For a fixed set of X; the Ts and Ds may 
be defined so that }°>7; = /D; = 0. 
Under He each D; = 0. Under H; some 
D;# 0, and the variance of the D,, 
op’ #0. This last variance may be 
termed the true variance of the dis- 
crepancies from the theory over the 
particular set of X; that was investigated. 

The foregoing is a conventional an- 
alysis of variance model, and the F ratio 
of the MSpevTheory divided by the 
MSpevEst provides an excellent and 
powerful test of Ho against H;. The 
number of degrees of freedom for SSpev Est 
will be k(m — 1) where n is the number 
of observations per data point, and the 
degrees of freedom for SSpey Theory will 
be k — mr, where mr is the number of 


degrees of freedom lost in the process of 
fitting the model! to the data. If this F 
is significant, we reject Ho, concluding 
that the discrepancies between the YF; 
and the Y’; are too great to be accounted 
for by the observed random variation in 
the experiment. In this conclusion we 
accept the 5% or 1% risk implied by 
our choice of a. 

Logical difficulties arise when F fails 
of significance. Ho remains tenable but 
is not proved to be correct. A tenable 
Ho provides some support for the theory 
but only in the negative sense of failing 
to provide evidence that the theory is 
faulty. To assert that accepting the Ho 
proves that the model provides a satis- 
factory fit to the data is an inaccurate 
and misleading statement. We may 
mean that we are satisfied, but others, 
especially proponents of other theories, 
will tend to regard our test as too lenient. 

Failure to reject He, instead of pro- 
ducing closure, leaves certain annoying 
ambiguities, but the tactics of testing 
this particular Ho imply a strategy that 
suffers from more serious defects that are 
readily apparent when the whole concep- 
tion of testing a theory is carefully con- 
sidered. To begin with, in view of our 
present psychological knowledge and the 
degree of refinement of available theo- 
retical models it seems certain that even 
the best and most useful theories are not 
perfect. This means, in terms of the 
analysis of variance model, there will be 
some nonzero D,’s. Ho, then, is never 
really “true.” Its “acceptance,” rather 
than “‘proving’’ the theory, merely indi- 
cates that in this instance the D;’s were 
too small to be demonstrated by the sensi- 
tivity of the experiment in question. 
The tactics of accepting Ho as proof and 
rejecting Ho as disproof of a theory lead 
to the anomalous results that a small- 
scale, insensitive experiment will most 
often be interpreted as favoring a theory, 
whereas a large-scale, sensitive experi- 
ment will usually yield results opposed to 
the theory ! 

Curiously enough, even rejection of Ho 
by means of a very stringent experi- 
mental test may be quite misleading as 
far as casting light on the adequacy of 
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the theory is concerned. If the D,’s are 
very small indeed the theoretical model 
may be a great improvement over any- 
thing else that is available and satis- 
factory for many purposes even though 
an extremely sensitive experiment were 
to reveal the nonzero D,'s. If our task, 
as scientists, were to test and accept or 
reject theories as they came off some 
assembly line the tactics of testing Ho 
could be made in a satisfactory manner 
simply by requiring that the test be 
“sufficiently” stringent. In fact, our 
task and our intentions are usually 
different from testing products; what we 
are really up to resembles quality control 
rather than acceptance inspection, and 
statistical procedures suitable for the 
latter are rarely optimal for the former 
(Grant, 1952, Chs. 1, 13). 


HyportruHEsis TESTING VERSUS 
STATISTICAL ESTIMATION 


An analogy will make clear the rela- 
tion between testing tactics and the in- 
tention of the tester. Suppose that I 
wish to test a parachute; how should I 
go about it? How I should test depends 
upon my general intentions. If I want 
to sell the parachute and am testing it 
only to be able to claim that it has been 
tested, and I do not care what happens 
to the purchaser, then I should give the 
parachute a most lenient, nonanalytic 
test. If, however, I am testing the para- 
chute to be sure of it for my own use, 
then I should subject it to a very strin- 
gent, nonanalytic test. But if I am in 
the competitive business of manufactur- 
ing and selling parachutes, then I should 
subject it to a searching, analytic test, 
designed to tell me as much as possible 
about the locus and cause of any failure 
in order that I may improve my product 
and gain a larger share of the parachute 
market. My contention is that the last 
situation is the one that is most analogous 
to that facing the theoretical scientist. 
He is not accepting or rejecting a finished 
theory; he is in the long-term business of 
constructing better versions of the theory. 
Progress depends upon improvement or 
providing superior alternatives, and im- 
provement will ordinarily depend upon 


knowing just how good the model is and 
exactly where it seems to need alteration. 
The large D,’s designate the next point 
of attack in the continuing project of 
refining the existing model. Therefore 
attention should be focused upon the 
various discrepancies between prediction 
and outcome instead of on the over-all 
adequacy of the model. 

In view of our long-term strategy of 
improving our theories, our statistical 
tactics can be greatly improved by 
shifting emphasis away from over-all 
hypothesis testing in the direction of 
statistical estimation. This always holds 
true when we are concerned with the 
actual size of one or more differences 
rather than simply in the existence of 
differences. For example, in the second 
instance of hypothesis testing cited at 
the beginning of this paper, where the 
investigator tests a pre-experimental 
difference, he would do better to obtain 
95% or 99% confidence interval for the 
pre-experimental difference. If the in- 
terval is small and includes zero, he 
(and any other moderately sophisticated 
person) knows immediately that he is 
on fairly safe ground; but if the interval 
is large, even though it includes zero, it is 
immediately apparent that the situation 
is more uncertain. In both instances Ho 
would have been accepted. 


TESTING A REVISED Ho 


Before turning to estimation pro- 
cedures that are useful in examining the 


correspondence between experimental 
outcomes and predictions from a mathe- 
matical model, I shall digress briefly to 
outline a_ statistical testing method 
which can legitimately be used in ap- 
praising the fit of a model to data as 
shown in Figure 1. Basically the sta- 
tistical argument in the proper test is 
reoriented so that rejection of Ho con- 
stitutes evidence favoring the theory. 
The new Ho is that the correlation be- 
tween the predicted values, Y’;, and the 
obtained values, Y;, is zero, after all 
correlation due to the fitting process has 
been eliminated. The alternative, Hy, 
against which Hp is tested is that there 
is a correlation greater than zero be- 
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tween theoretical and empirical points. 
The four simple steps required to obtain 
the necessary F test are as follows: 


1. Calculate t; = Y’; — Y for all iz. 
>i; = 02 

ya Calculate SScorrespondence == 
n(St;¥;)?/Zt?, where n is the number of 
observations upon which each FY; is based. 
Negative values of (2t;¥;) are treated as 
zero. 

3. Obtain M Scorrespondence = SScorre- 
spondence/"%7, Where m7, the number of 
degrees of freedom involved in fitting 
the theoretical points to the empirical 
data, will ordinarily be the number of 
linearly independent fitting constants 
in the mathematical expression of the 
model. 

4. Divide M Scorrespondence by MSpev Est 
to give Fcorrespondence Which has mr de- 
grees of freedom for its numerator and 
k(n — 1) degrees of freedom for its 
denominator, k being the number of F,. 
The test is one-tailed in the sense that 
negative values of Lt;¥; are treated as 
zero values, so that the probability 
values of the F distribution must be 
halved, an unusual procedure with F 
tests in analysis of variance. 


Following the above procedure, rejec- 
tion of Ho now means that there is more 
than random positive covariation be- 
tween predicted and obtained values of 
the dependent variable. 

This test is admirable in that it puts 
the burden of proof on the investigator, 
because a small-scale, insensitive experi- 
ment is unlikely to produce evidence 
favoring the model. Furthermore, if 
the model has any merit, the more sensi- 
tive the experiment, the more likely it 
is that a significant F, favoring the 


?In the unusual event where the-general 
mean of the observations, Y, is not used as a 
fitting constant for Y’ = f(X), the 4; must be 
computed as deviations from the mean of all 
the ¥’;, ¥’. The test will then be insensitive 
to discrepancies between Y’ and Y, and the 
interpretation will be somewhat equivocal. 
A separate test of Ho that Ypopulation equals 
Y’, is feasible, but here the experimenter is 
forced into the illicit posture of seeking to 
embrace Ho. 


theory, will be obtained. Actually, the 
test is extremely sensitive to virtue in 
the theory, and therefore in the case of a 
moderately successful model and a mod- 
erately sensitive experiment both this F 
and the one testing the significance of 
systematic deviations from the model 
(F = MSpev Theory/MSpev Est) will tend 
to be significant. This outcome is no 
anomaly; it merely indicates that the 
model predicts some but not all of the 
systematic variation in the data. In 
short, progress is being made, but im- 
provement is possible. The fact that 
simultaneous significance of both Fs, 
indicating general success and specific 
failures of a model, should be a common- 
place points up the necessity of turning 
to methods of statistical estimation for a 
more adequate examination of the work- 
ings of a theoretical model. 


PRACTICAL ESTIMATION METHODS FOR 
INVESTIGATION OF MODELS 


As is true of statistical tests, each 
method of statistical estimation has its 
advantages and limitations. In the 
investigation of the adequacy of theo- 
retical curves in psychology there are 
reasons to believe that the simpler 
estimation methods have practical ad- 
vantages over some of the more elegant 
procedures. To give a fairly complete 
view of the situation, methods of point 
and interval’ estimation of op? and of 
the individual D; will be described, and 
a brief evaluation of each method will 
be given. 

Estimating op. The variance of the 
discrepancies between the Y’; and the 
Y; condenses into a single number the 
adequacy of fit of the theoretical model. 
As such it is an excellent index for the 
evaluation of the model. The smaller 
the variance, ap”, the better the model, 
and vice versa. As an estimate of the 
size of the discrepancies one might ex- 
pect in future similar applications of the 
model, gp? is far more informative than 
any F test. Furthermore ap? is readily 
estimated in the case of homogeneity of 
the error variance, ¢2. The expected 
values of the relevant mean squares are 
as follows: 
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[4] 
[5] 


A maximum likelihood estimate of the 
variance of the discrepancies, ¢p’ is then: 


6p? = (MSpev Theory — MSpeveEst)/n [6] 


The accuracy of this estimator depends 
upon the number of degrees of freedom 
associated with SSpey Theory and SSpevy Est- 
The latter rarely poses any practical 
problem, but the former, in view of the 
predilection of psychologists for mini- 
mizing the number of data points, is 
quite critical. This is readily apparent 
when interval estimation of gp? is 
attempted. 

Bross (1950) gives a convenient method 
for accurate approximation of the fidu- 
cial interval for ap”, and in this case the 
fiducial and confidence intervals are 
essentially equal. The method will be 
outlined below for the 5% interval. 


Exp(MSpev Theory) = 72 + nop? 


Exp(MSpev Est) = o2 


1. Obtain ¢p* from Equation 6, above. 
(If the estimate is negative or zero, 
meaningful limits cannot be obtained.) 

2. Find: 

F 


- - —-—1 
Fo25(k—nr, k{n—1}) 
Leo 
F. Fo25¢k nr, @) 
Fo25¢k —nr, k[n—1)}) 
where: 


F = MSpev Theory/MSpev Est 


Fo2(k—nr, k{n—1}) iS the entry in the 2.5% 
F table (Pearson & Hartley, 1954) for 
m =k— nr and no = k[n— 1]; and 
Fo25(k—np, «) is the entry for my = k — nr 
and mz = ©, 


3. Find: 


L he F- Fos (etn—u, _. a 


PF: Foas¢kin—1), k—mr) _ 


F025 (0, k—nr) 


where Fo25(é{n—1], k-np) is the entry in 
the 2.5% F table for n, = k[n — 1], and 
ny = k— nr; Fowsce, kn) iS the entry 
for ny = ©, and mz = k — nr. 

4. The upper and lower limits are 
then Lép? and Lép’*, respectively. With 


less than 15-20 data points these limits 
will be found to be uncomfortably wide, 
a fact to bear in mind when designing an 
experimental test of a theoretical model. 
For example, in Figure 1, with 6 data 
points and two degrees of freedom for 
curve fitting, the limits might plausibly 
be 0-40, whereas with 14 data points the 
limits might be 0-12. 

Aside from the considerable variability 
in the estimate of op? which can be 
reduced by increasing the number of 
data points, there are two other impor- 
tant limitations to the use of estimates of 
the variance of the discrepancies in 
evaluating a model. First of all, the 
population value of op? is completely 
dependent upon the particular values of 
the independent variable, X, which are 
chosen for the test of the model. Choice 
of two different sets of Xs could well 
lead to two entirely different values of 
op’, and both of these values could be 
perfectly accurate. Secondly, although 
op* gives an over-all index of the ade- 
quacy of the model being tested, it 
condenses so much information into one 
measure that it does not permit pin- 
pointing the especially large D,’s so 
that they can be given proper attention 
in considering revision of the model. 

Estimating the D;. The individual D; 
may be estimated as points, orintervals 
may be established for the Dj;, collec- 
tively or individually. As before, each 
method has its good points and its 
limitations. 

Point estimation of the individual D,’s 
consists simply in comparing the indi- 
vidual data points, the Y;, with the 
fitted curve. It is a crude method, but 
it has served well in the past and repre- 
sents the beginning of wisdom. For 
example, in Figure 1, the model builder 
might well note that the first three data 
points lie below the curve and ask him- 
self if there is some special reason for 
this. He would also note that the great- 
est discrepancy occurs at Y5, where the 
neighboring discrepancies are in the 
other direction. The weakness of this 
simple method lies in the absence of a 
criterion which will assist the investigator 
in deciding which discrepancies should be 
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singled out for further attention and 
which may be disregarded because they 
are within the range of expected random 
variation. This defect is remedied by 
the interval estimation techniques. 
Probably the ideal method of interval 
estimation is that in which intervals are 
established for the whole curve im one 
operation by finding the 95% confidence 
band. The method takes the theoretical 
curve as a point of departure, and the 
result is a pair of curves above and below 
the theoretical curve, which will tend 
in the case of random variation to con- 
tain between them 95% of the data 
points. Points lying outside the band 
are immediately suspect; they are the 
most promising candidates for attention 
in the next version of the model. There 
are two practical difficulties with this 
method. First, homogeneity of the error 


variance, o,’, over all the X; is required. 
And secondly, because errors in estima- 
tion of each fitting parameter must be 
taken into account, for all but the sim- 
plest curves (Cornell, 1956, pp. 184-186) 
the bands may be difficult? to obtain. 


Although the method is elegant, in 
practice it will rarely represent sufficient 
improvement over the final method, 
given below, to justify its use. 

The last method seems to me to be the 
most useful and most robust and most 
flexible method. It can be widely ap- 
plied, and the relative ease of applica- 
tion, coupled with its ability to dis- 
criminate between significant’ and ran- 
dom discrepancies make it superior to 
the other estimation methods. It also 
possesses the homely virtue of being 
readily understood. In contrast to the 
preceding method, this one takes as its 
point of departure the empirical means, 


3 A sufficient estimate of the error variance 
of each parameter must be available and inde- 
pendent of the estimates of all other param- 
eters or else the covariances of all parametric 
estimates must be found and the theoretical 
function must have continuous first partial 
derivatives with respect to the parameters in 
order that the confidence bands may be found 
in the asymptotic case (Rao, 1952, pp. 207- 
208). Where an asymptote is involved in the 
fitting of the theoretical function, satisfactory 
independent estimators can rarely be obtained. 


and consists, simply, in computing the 
95% confidence limits for each of the F;. 
If there is homogeneity of variance, the 
error variance of each mean is taken 
simply as ¢2/n; in cases of suspected 
heterogeneity, each mean must have its 
own estimate of error variance. This 
will, of course, be the variance of the 
distribution of Yj; for each i, divided by 
n. When these limits have been ob- 
tained, attention is directed to instances 
where the theoretical curve lies outside 
the limits. In some cases, the investi- 
gator might choose to establish the 80% 
or 90% limits in order to direct his 
attention to less drastic departures of 
the experimental results from the model. 
Choice of an optimum level for the limits 
is hard to establish on a general a priori 
basis, but it is likely that limits narrower 
than the traditional 95% will be found 
more useful than the broader limits. 
Simple as this method is, it is hard to 
improve upon in actual practice. In- 
stead of giving an almost meaningless 
over-all acceptance or rejection of a 
model, it directs attention to specific 
defects, its functioning improves as the 
precision of the experimental test is 
improved, and the investigator can set 
the confidence coefficient so as to in- 
crease its sensitivity to defect at a cost 
of a fairly well-specified percentage of 
false positives or wild goose chases. A 
final and often crucial advantage is that 
the confidence intervals, based as they 
are upon the experimental means, can be 
obtained in cases where the form of the 
theoretical function does not permit 
satisfactory estimation of its parameters, 
and the analysis of variance and con- 
fidence bands methods cannot properly 
be applied. 


SUMMARY AND CONCLUSIONS 


In this paper I have attempted to show 
that the traditional procedure of testing 
a null hypothesis (Ho) of a zero difference 
or set of zero differences is quite appro- 
priate to the experimenter’s intentions or 
scientific strategy when he is unable to 
predict differences of a specified size. 
When theory or other circumstances per- 
mit the prediction of differences of 





THEORETICAL NoTES 61 


specified size, using these predictions as 
the valuesin Hgis tactically inappropriate, 
frustrating and self-defeating. This is 
particularly true when a theoretical curve 
has been predicted, and Ho is framed in 
terms of zero discrepancies from the 
curve. If rejection of Ho is interpreted 
as evidence against the theory, and 
“acceptance” of Ho-is interpreted as 
evidence favoring the theory, we find that 
the larger and more sensitive the experi- 
ment is, the more likely it will lead to 
results opposed to the theory; whereas 
the smaller and less sensitive the experi- 
ment, the more likely the results will 
favor the theory. Aside from this 
anomaly, which can be corrected by re- 
casting Ho in terms of a zero covariance 
between theoretical prediction and ex- 
perimental outcome, hypothesis testing 
as a statistical tactic in this case implies 
an acceptance-inspection strategy. Ac- 
ceptance-inspection properly involves ex- 
amination of finished products with a 
view to accepting them if they are good 
enough and rejecting them if they are 
shoddy enough. The theoretician is not 


a purchaser but rather he is a producer 
of goods in a competitive market so that 
his examination of his theory should be 
from the standpoint of quality control. 


His idealized intentions are to detect 
and correct defects, if possible, so that 
he can produce a more adequate, more 
general theoretical model. Because his 
ideal strategy is not to prove or disprove 
a theory but rather to seek a better 
theory, his appropriate statistical tactics 
should be those involving estimation 
rather than hypothesis testing. 
Examination of alternative techniques 
available for point or interval estimation 
of discrepancies between theoretical pre- 
dictions and experimental outcomes or 
the over-all variance of these discrepan- 
cies suggests strongly that estimation of 
the confidence intervals for the means 


found along a theoretical curve is the 
most practical and most widely appli- 
cable general procedure. Other writers 
have recently emphasized the values of 
various estimation as opposed to hy- 
pothesis testing techniques (e.g., Bolles 
& Messick, 1958; Gaito, 1958; Savage, 
1957) and it is hoped that considerations 
pointed out by them and points raised in 
this paper will be helpful to investigators 
who are in the process of examining 
theoretical models which lead to specific 
numerical predictions of experimental 
outcomes. 
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THE CONSTRUCTION OF Q SORTS: 
A CRITICISM 


DONALD M. SUNDLAND1?1 
Columbus Psychiatric Clinic 


use of Q sorts makes 
certain difficulties in 
item selection be emphasized. In this 
author’s opinion, these difficulties are 
such as to suggest the discontinuance of 
the use of this technique for research 
purposes, unless corrective measures are 
devised. 

The Q sort is a technique derived from 
Q-methodology practice (Mowrer, 1953; 
Stephenson, 1955). In using this tech- 
nique, comparison is made between an- 
swers to a large number of “tests” (items) 
from different persons (usually judges), 
or from the same person from different 
points of view: “self, ideal self, self as 
seen by mother, etc.” An implicit, under- 
lying assumption is that the tests are 


The widespread 
it important that 


not inherently correlated. Operationally 
speaking, if two judges agree that the 
quality “shy” is “very characteristic” of 
Person X, then their agreement about 
the qualities “bashful, modest, outgoing, 
extroverted, etc.” is, at least partially, 
predetermined by the inherent correla- 


tions between the items. Now this would 
be all right if all the items were about 
shyness or nonshyness, then agreement 
between judges would mean that they 
agreed on the shyness of X. Character- 
istically this is not the case. Items rela- 
tive to the broad range of personality 
traits are usually included in a Q sort. 
Under these circumstances the inclusion 
of items inherently related will lead to 
spurious correlations between Q sorts. 
This difficulty has been pointed to be- 
fore, but seemingly without appreciation 
of its seriousness. Stephenson (1955) 
wrote: “no set contained direct anto- 
nyms” (p. 257) suggesting that this was 


1The author expresses his thanks to 
Robert J. Wherry, Edwin N. Barker, and 
Shephard Liverant for their valuable com- 
ments on this paper. 
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something to be avoided. 
son? said: 


Later Stephen- 


you are quite right about ... the use of 
direct antonyms. Little is to be gained by 
using words like introversive and extro- 
versive in the same Q-sample, one of them 
is enough. ’ 


Cronbach (1953) said, in discussing cri- 
teria for the selection of items for QO 
sorts: 


statements, while logically bearing on the 
same domain, should represent a large num- 
ber of continua. Correlating persons seems 
to have no advantage if we present items 
which all fall into one scale (p. 380). 


Two types of statistical distortion are 
involved in this use of highly related 
items. The first is an inflation of the 
correlation coefficient, and the second is 
an inflation of the number of degrees of 
freedom. By using the total number of 
presumably unrelated items as the degrees 
of freedom, one markedly inflates the de- 
grees of freedom in testing the signifi- 
cance of the correlation if many of the 
items are, in fact, correlated. A rather 
obvious example would be using only 
items related to a single dimension. If 
30 items concerning a single trait were 
used to judge a person, the correlation 
between judges would be accurate and 
misleading at the same time. It would 
be accurate insofar as the judges’ agree- 
ment about the trait, but it would be mis- 
leading about the person insofar as this 
is the only trait being considered. This 
is pretty obvious; however, now let us 
intermingle these 30 monotrait items 
among 70 other items, these 100 items 
supposedly representing the broad spec- 
trum of personality traits. Further, let 


2W. Stephenson, personal communication, 


1957. 
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us suppose that two judges have a zero 
correlation on these 70 other items, but 
they have a perfect, 1.00, correlation on 
the 30 monotrait items. The result of 
this intermingling would depend upon the 
placement of the 30, agreed upon items. 
The author devised a set of data to ex- 
amine this problem. Seventy items were 
randomly distributed into the 11-cate- 
gory, approximately normal distribution 
shown in Table 1 and then minor adjust- 
ments were made to obtain an exactly 
zero correlation. The 30 agreed upon 
items were then intermingled in three 
ways. The first way was to place them 
in the extreme categories of the com- 
bined 100-item distribution shown in 
Table 1; the second was to place them 
in the middle categories; and the third 
was to distribute them proportionately. 
This procedure yields the correlations: 
0.76, 0.11, and 0.30. A correlation of 
0.26 for 100 items is significant at the 
0.01 level. Considering the monotrait 
items as really one item, this example 
shows two types of overestimation: the 
correlations found range between .11 and 
.76 instead of from .00 to .06, and the 
degrees of freedom used are 98 (100-2) 
instead of 69 (71-2). 

Let us look at another frequent use of 
the Q sort: self vs. ideal-self comparison. 
Where the monotrait items are rated in 
the same direction on both self and ideal- 
self Q sorts, the correlation will be over- 
estimated. Where the monotrait items 
are rated in the opposite directions, the 
correlation will be underestimated. This 
distortion would be augmented if the 
experimenter were comparing Q sorts 
before and after therapy and if the mono- 
trait items shifted their directions. For 
example, let us suppose that before 
therapy a patient sees himself as shy but 
as ideally not shy, and that after therapy 
he sees himself as not shy both actually 
and ideally. If the Q sort had many 
“shyness” items in it, the difference be- 
tween the pretherapy and posttherapy 
correlation would be doubly overesti- 
mated since the pretherapy correlation 
would have been underestimated and the 


TABLE 1 
Item DISTRIBUTIONS 








Number 
of 
Items 


Categories 





70 items 5 
100 items ie 7} 10) 10 





























posttherapy correlation would have been 


overestimated. 

It would seem that a possible hope for 
the comparison of persons, or of different 
concepts of a person, would be the use 
of items whose interrelatedness is known. 
Levy and Dugan (1960) have proposed a 
method which offers a possible system 
for determining the interrelatedness of 
items. An investigator adapting their 
design would ask the subjects, from the 
same population as his experimental sub- 
jects, to describe a photograph with his 
Q-sort items. Each subject would be 
given a different photograph. The items 
could then be intercorrelated and factor 
analysed into orthogonal components. In 
using the Q sort with his experimental 
subjects the investigator could then obtain 
factor scores and use these for his com- 
parisons, Since this is a somewhat 
laborious procedure it would seem to be 
desirable for investigators to come to 
some consensus about the items for two 
or three widely applicable @Q_ sorts. 
Short of the availability of independent 
items or items with known dependencies, 
any kind of Q-sort study is subject to 
this distortion. 


SUMMARY 


It has been pointed out that due to the 
interrelatedness between items which 
constitute a Q sort, investigators obtain 
spurious results due to an overestimation 
of the correlation coefficient and of the 
degrees of freedom in their Q sort. The 
difficulty of determining the correct mag- 
nitude of the correlation coefficient and 
the correct number of degrees of freedom 
to be used indicates the desirability for 
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the development of alternate methods for 
Q-type studies. 
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